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NEW PLANT GENES 

FIELD OF THE INVENTION 

5 This invention relates to glutathione transferase 

(GST) subunits, to nucleic acid sequences encoding 
glutathione transferase subunits, and to uses of these 
glutathione transferases and coding sequences, especially 
in the field of plant biotechnology. 

10 

BACKGROUND OF THE INVENTION 

Glutathione transferases (GSTs, EC. 2.5.1.18), also 
referred to as glutathione 5-transf erases , are 
15 multifunctional enzymes capable of catalysing the 
conjugation of electrophilic substrates with the 
tripeptide glutathione (GSH, gamma- 

glutamylcysteinylglycine) . The electrophilic substrate 
may be of natural or synthetic origin, examples including 

20 endogenous stress-metabolites, drugs, pesticides and 

pollutants. Conjugation with GSH renders the compounds 
non-toxic and suitable for export from the cytosol and 
further metabolism. In addition to their activities in 
GSH conjugation, GSTs may have additional activities as 

25 glutathione peroxidases, catalysing the reduction of 
organic hydroperoxides to the corresponding alcohol 
according to the reaction: 

R-OOH + 2 GSH > R-OH + GSSG. 

30 

All known active GSTs are composed of two 
polypeptide subunits, with each subunit possessing a 
binding site for GSH and the electrophilic co-substrate. 
The two subunits may either be identical, giving rise to 
35 a homodimer, or dissimilar giving rise to heterodimers - 
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GSTs may therefore be defined according to their source, 
or classr and their component subunits according to the 
nomenclature SpGST x-y, where Sp = source or class of 
GST; X and y describe the subunit types. 

5 

Each discrete subunit is encoded by a distinct gene, 
with many eucaryote containing GST multigene families 
encoding multiple isoenzymes . 

10 The plant in which GSTs have been characterised in 

the greatest detail is maize (Zea mays L.)- The major 
maize GSTs are composed of three discrete subunits, 
termed I, II and III. These subunits associate together 
to form three isoenzymes containing the Zea mays GST I 

15 subunit, namely ZmGSTI-l, ZmGSTl-lI and ZmGSTI~lll as 

well as the homodimers ZinGSTII-II and ^inGSTIII-III . The 
nucleotide sequences of ZmGSTl, ZmGSTll and Z/nGSTIII have 
been determined. In view of their relatedness in 
sequence, these maize GSTs have collectively been termed 

20 type 1 plant GSTs- Additional maize GSTs with activities 
toward herbicides have been described as ZjtiGSTV-V and 
2i7?GSTV-VI . The sequence of ZmGSTV differs markedly from 
the other maize GSTs described to date, resembling the 
auxin-inducible GSTs from dicotyledenous plants which 

25 have been termed the type 3 GSTs. 

The maize GST subunit types are associated with 
differing substrate specificities. The ZmGSTl subunit has 
broad-ranging, but low, activities toward chloro-s- 

30 triazine, chloroacetanilide and diphenyl ether 

herbicides. The ZmGSTll and ZmGSTIII subunits show 
greater specificity toward chloroacetanilides , while 
ZmGSTV and ZmGSTVl are highly active toward diphenyl 
ethers. The GST isoenzymes differ in their patterns of 

35 expression in the organs of maize. Thus, ZmGSTl-1 and 



ZjnGSTV-V are expressed in all plant parts, while ZmGSTl- 
II is root specific. The expression of the GST subunits 
is also differentially affected by herbicide safeners. 
These are compounds which enhance the tolerance of cereal 
crops to herbicides, in part, by increasing the 
expression of detoxifying enzymes such as GSTs . Thus, the 
Z/nGSTII and ZmGSTV subunits accumulate in maize seedlings 
following treatment with the safeners dichlormid or 
benoxacor while the ZmGSTl and ZjnGSTIII subunits are only 
modestly enhanced by safeners. 

Far less is known regarding GSTs in plant species 
other than maize. GSTs with activities toward non- 
herbicide substrates have been identified in some plants, 
and mRNAs apparently encoding GSTs have been shown to be 
expressed in plants including carnation, tobacco and 
thale cress (Arabidopsis thaliana) . However, isoenzymes 
with activities toward herbicides have only been 
definitively identified in soybean, pea and pine trees. 
Of these, only in soybean has the nucleotide coding 
sequences of the herbicide-detoxifying GST been reported. 



GSTs in plants have also been shown to have 
secondary activities as glutathione peroxidases, able to 
reduce organic hydroperoxides, such as fatty acid 
hydroperoxides to the corresponding monohydroxy alcohols. 
GSTs with glutathione peroxidase activity have been 
isolated from peas, soybean. A, thaliana and wheat 
flour. Since fatty acid hydroperoxides are a common 
result of membrane peroxidation imposed during oxidative 
stress, glutathione peroxidases provide an important 
cytoprotective function in preventing the accumulation of 
fatty acid hydroperoxides and their subsequent 
degradation to toxic aldehydes. Glutathione peroxidases 
may therefore have a vital function in protecting plant 
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cells from oxidative stress. The intervention of 
glutathione peroxidases in lipid peroxidation has also 
been cited as a determinant of flour quality in wheat . 

5 Of particular relevance to this invention is the 

lack of knowledge concerning the GSTs of wheat (Triticum 
aestivum L . ) • 

Some information is available from experiments on 
10 whole plants and plant extracts. Several herbicides 

including examples of the chloroacetanilides , as well as 
dimethenamid and f enoxaprop-ethyl undergo GSH conjugation 
in the course of their detoxification in wheat. Also^ in 
crude plant extracts GST activities toward 
15 chloroacetanilide herbicides and f enoxaprop-ethyl have 
been demonstrated . 

However, there is only one previous report of the 
purification of a GST from wheat. This GST, purified from 
20 wheat flour, was described as a homodimer of 27.5 kDa ^ 
polypeptides with activity toward the non-herbicide 
substrate l-chloro-2 , 4-dinitrobenzene (CDNB) and 
glutathione peroxidase activity toward fatty acid 
hydroperoxides . 

25 

Moreover, very little is known regarding GST genes 
in wheat. An mRNA originally described as wir5, which 
showed sequence similarity to the type 1 maize GSTs, was 
identified as accumulating in wheat leaves during the 

30 onset of acquired resistance to powdery mildew (Erysiphe 
graminis) . The gene was termed gstAl and shown to be 
similar in genomic organisation to maize ZmGSTl . The 
gstAl polypeptide was expressed in recombinant bacteria 
and shown to have an apparent molecular mass of 29 kDa. 

35 The respective enzyme showed GST activity towards the 



non-herbicide CDNB, though the activity toward other 
substrates and activity as a glutathione peroxidase was 
not reported. An antibody was raised to the recombinant 
GstAl and used in Western blotting experiments to show 
that this GST was specifically induced in wheat leaves by 
pathogen attack. In contrast, a distinct class of GSTs 
composed of 25 kDa and 26kDa subunits, which were 
recognised by an antiserum raised to undefined GSTs in 
maize, accumulated following exposure to cadmium and the 
herbicides atrazine, alachlor and paraquat. The 
activities of these xenobiotic-inducible GSTs in wheat 
and the corresponding nucleotide sequences were not 
reported . 

Thus, although wheat is an important crop plant, 
there has been almost no molecular characterisation of 
wheat GSTs or their genes and, to date, only one purified 
GST and one GST gene sequence is available, gstAl . 

Significantly, the purified recombinant GST protein 
expressed from gene gstAl was not reported to exhibit 
activity towards herbicides. Hence, none of the previous 
work on wheat GSTs actually provides any means of 
achieving herbicide resistance based on the function of 
wheat GSTs. 

SUMMARY OF THE INVENTION 

We have purified four GST isoenzymes with activity 
toward herbicides from wheat shoots treated with the 
herbicide safener f enchlorazole-ethyl and have identified 
four distinct subunits. In saf ener-treated shoots, we 
have found that the predominant GST subunit is a 25 kDa 
polypeptide, which has been termed Triticum aestivum GST 
1 (TaGSTl). Additionally, two distinct 26 kDa subunits 



have been identified and termed TaGST2 and TaGST3 and a 
24 kDa subunit, termed TaGST4 . These subunits associate 
together to form the active dimeric isoenzymes TaGSTl-l, 
raGSTl-2, TaGSTl-3 and TaGSTl-4 . 

In our experiments, the expression of all four 
isoenzymes was affected by the herbicide safener 
f enchlorazole-ethyl , although each one responds in a 
somewhat different way. The TaGSTl-l isoenzyme is the 
major GST present in the leaves of untreated wheat 
seedlings, and its expression is increased by 
approximately 50% following exposure to f enchlorazole- 
ethyl. TaGSTl-4 is expressed at low levels in untreated 
shoots and its expression is greatly increased by safener 
application, while TaGSTl-2 and TaGSTl-3 are only 
observed following treatment with the safener. All four 
of these GST isoenzymes have broad-ranging activities 
toward xenobiotic substrates and all four demonstrate 
activity towards herbicides and additional activities as 
glutathione peroxidases able to reduce organic 
hydroperoxides, with TaGSTl-4 being the most active in 
this respect. Each isoenzyme also has specific 
properties- Thus, for example, detoxification of one 
particular herbicide, f enoxaprop-ethyl , is associated 
with the more strongly saf ener-inducible TaGSTl-2, 
TaGSTl-3 and TaGSTl-4 heterodimers , rather than with the 
TaGSTl-1 homodimer . 

Furthermore, we have identified, cloned and 
sequenced cDNAs for the major type 3 GSTs in wheat, 
together with cDNAs encoding a range of type I GSTs, all 
active in herbicide metabolism. This is fundamental to 
understanding the GST detoxification system in wheat and 
to exploiting it to generate transgenic herbicide- 
resistant plants expressing wheat GSTs. In many previous 



studies, GST activity could not be linked to specific 
genes, precluding this approach - 

From the sequences of the cDNAs the amino acid 
sequences of the GST subunits themselves has been 
deduced . 

Accordingly, the invention provides; 

a polynucleotide encoding a glutathione transferase 
(GST) subunit, which polynucleotide comprises a coding 
sequence capable of hybridising selectively to the coding 
sequence of SEQ ID No. 1, 3, 5, 7, 9, 11 or 13 or to the 
complement of one of those sequences. 

The invention also provides: 

a polypeptide which is a GST subunit and comprises 
the amino acid sequence of SEQ ID No. 2, 4, 6, 8, 10, 12 
or 14 or a sequence substantially homologous thereto, or 
a fragment of either said sequence. 

The invention also provides: 

a dimeric protein comprising two GST subunits, 
wherein at least one subunit is a polypeptide of the 
invention . 

The invention also provides: 

a chimeric gene comprising a polynucleotide of the 
invention operably linked to regulatory sequences that 
allow expression of the coding sequence in a host cell- 



The invention also provides: 



a vector comprising a polynucleotide of the 
invention or a chimeric gene of the invention. 

The invention also provides: 

a cell transformed or transfected with a vector of 
the invention. 

The invention also provides: 

a cell having, integrated into its genome, a 
chimeric gene of the invention. 

The invention also provides: 

a process for the production of a polypeptide of 
the invention, which process comprises: 

(a) cultivating a cell of the invention under 
conditions that allow the expression of the polypeptide; 
and 

(b) recovering the expressed polypeptide. 
The invention also provides: 

a process for the production of a dimeric protein 
of the invention, which process comprises: 

(a) cultivating a cell of the. invention under 
conditions that allow: 

(i) the expression of the polypeptide of the 
invention and, if a further polynucleotide sequence as 
defined herein is present, optionally the expression of a 
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20 



25 



30 
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further GST subunit encoded by a further polynucleotide, 
and 

(ii) the association of the GST subunit polypeptide 
of the invention with another GST subunit polypeptide to 
form a dimeric protein of the invention; and 

(b) recovering the dimeric protein so formed. 
The invention also provides: 

a method of obtaining a transgenic plant cell 
comprising : 

(a) transforming a plant cell with an expression vector 
of the invention to give a transgenic plant cell, 

and optionally, 

(a') transforming the cell with one or more further 
polynucleotide sequences coding for a GST subunit, 
operably linked to regulatory elements that allow 
expression of the subunit in the cell. 

The invention also provides: 

a method of obtaining a first-generation transgenic 
plant comprising: 

(b) regenerating a transgenic plant cell transformed 
with a vector of the invention to give a transgenic 
plant . 

The invention also provides: 



35 



a method of obtaining a transgenic plant seed 



comprising : 



(c) obtaining a transgenic seed from a transgenic plant 
obtainable by regenerating a transgenic plant cell 
transformed with a vector of the invention. 

The invention also provides: 

a method of obtaining a transgenic progeny plant 
comprising obtaining a second-generation transgenic 
progeny plant from a first-generation transgenic plant 
obtainable by regenerating a transgenic plant cell 
transformed with a vector of the invention, and 
optionally obtaining transgenic plants of one or more 
further generations from the second-generation progeny 
plant thus obtained. 

The invention also provides: 

a method of obtaining a transgenic progeny plant 
comprising obtaining a second-generation transgenic 
progeny plant from a first-generation transgenic plant 
obtainable by regenerating a transgenic plant cell 
transformed with a vector of the invention comprising: 

(c) obtaining a transgenic seed from a first-generation 
transgenic plant obtainable by regenerating a transgenic 
plant cell transformed with a vector of the invention, 
then obtaining a second-generation transgenic progeny 
plant from the transgenic seed; 

and/or 

(d) propagating clonally a first-generation transgenic 
plant obtainable by regenerating a transgenic plant cell 
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transformed with a vector of the invention to give a 
second-generation progeny plant; 

and/or 

(e) crossing a first-generation transgenic plant 
obtainable by regenerating a transgenic plant cell 
transformed with a vector of the invention with another 
plant to give a second-generation progeny plant; 

and optionally; 

(f) obtaining transgenic progeny plants of one or more 
further generations from the second-generation progeny 
plant thus obtained. 

The invention also provides: 

a transgenic plant cell, first-generation plant, 
plant seed or progeny plant obtainable by a method of the 
invention . 

The invention also provides: 

a transgenic plant or plant seed comprising plant 
cells of the invention. 
The invention also provides: 

a transgenic plant cell callus comprising plant 
cells of the invention, or obtainable from a transgenic 
plant cell, first-generation plant, plant seed or progeny 
plant of the invention. 



The invention also provides: 



use of a polynucleotide of the invention as a 
selectable marker for detecting transformation of a plant 
cell. 

The invention also provides: 

a nucleic acid construct comprising: 

(a) a polynucleotide of the invention operably linked 
to regulatory elements that allow expression of the 
coding sequence in a plant cell; and 

(b) a site into which a further polynucleotide 
comprising a coding sequence can be inserted. 

The invention also provides : 

a vector comprising such a construct. 

The invention also provides : 

a method of transforming a plant cell or of 
obtaining a plant cell culture or transgenic plant 
comprising : 

(a) providing an untransf ormed plant cell which is 
susceptible to a herbicide whose herbicidal activity is 
reduced by a dimeric protein of the invention; 

(b) transforming the plant cell with a vector 
comprising : 

(i) a polynucleotide of the invention operably 
linked to regulatory elements that allow expression of 
the coding sequence in a plant cell; and 
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(ii) a site into which a further polynucleotide 
comprising a coding sequence can be inserted; 

(c) cultivating the transformed cell under conditions 
that allow the expression of the polynucleotide (a) in 
the construct; and/or 

(c') regenerating the cell to give a cell culture or 
plant such that the polynucleotide (a) in the construct 
is expressed; and 

(d) contacting the cell, cell culture or plant with the 
herbicide whose herbicidal activity is reduced by the 
dimeric protein of the invention, and to which the 
untransf ormed plant cell was susceptible; and 

(e) selecting cells, cell cultures or plants that are 
less susceptible to the herbicide than are corresponding 
untransf ormed cells, cell cultures or plants. 

The invention also provides: 

use of a dimeric protein of the invention in a 
method of identifying compounds capable of metabolism by 
a GST. 

The invention also provides: 

a method of identifying compounds capable of being 
metabolised by a glutathione transferase comprising: 

(a) contacting a candidate compound suspected of being 
capable of being metabolised by glutathione transferase 
with glutathione (GSH) in the presence of a dimeric 



4 
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protein of the invention; and 

(b) determining whether or not metabolism of the 
candidate compound takes place. 

5 

The invention also provides: 

compounds identified by such methods. 

10 The invention also provides: 

a kit for detecting compounds capable of being 
metabolised by a GST comprising: 

15 (a) reduced glutathione or homoglutathione ; 

and 

(b) a dimeric protein of the invention. 

20 The invention also provides: 

an antibody which specifically recognises a 
polypeptide or dimeric protein of the invention. 

25 The invention also provides: 

a nucleic acid probe which selectively hybridises 
to the sequence of SEQ ID No. 1, 3, 5, 7, 9, 11 or 13. 

30 The invention also provides: 

a method of identifying compounds that induce GST 
expression in graminaceous plants comprising: 



35 



(a) contacting a graminaceous plant, or a cell or cell 
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culture thereof, with a candidate compound suspected of 
being capable of inducing GST expression; and 

(b) determining the level of GST expression in the 
5 plant, cell or cell culture. 

The invention also provides: 

compounds identified by such methods. 

10 

The invention also provides: 

a kit for identifying compounds that induce 
GST expression in plants by such a method, which kit 
15 comprises an antibody of the invention. 

The invention also provides: 

a method of determining the GST level in a sample 
20 of seed or flour comprising: 

(i) determining the level of GST protein 
present by using an antibody of the invention; 
or 

25 

(ii) determining the level of GST mRNA present 
using a probe of the invention. 

The invention also provides: 

30 

a method of controlling the growth of weeds at a 
locus where a transgenic plant of the invention is being 
cultivated, which method comprises applying to the locus 
a herbicide whose herbicidal properties are reduced by a 
35 dimeric protein of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 . Anion-exchange chromatography of 
affinity-purified wheat GSTs . 

Chromatography of A: affinity-purified polar GSTs; 
and B: affinity-purified hydrophobic GSTs on Hi-Trap Q- 
Sepharose columns eluted with the increasing NaCl 
gradient shown. The eluent was monitored for Ajgo shown 
with the unbroken line and individual fractions assayed 
for GST activity. 

Figure 2 . HPLC analysis of wheat GST subunits . 

Reversed-phase HPLC analysis of polypeptide subunits 
present in A, affinity-purified polar GSTs; B, affinity- 
purified hydrophobic GSTs; C, the isoenzyme TaGSTl-l, 
resolved by anion-exchange chromatography of the 
affinity-purified polar GSTs. 

DETAILED DESCRIPTION OF THE INVENTION 

Polynucleotides 

The invention provides polynucleotides comprising 
sequences encoding novel GST subunits, SEQ ID Nos 1, 3, 
5, 7, 9, 11 and 13 and sequences that hybridise 
selectively to these coding sequences thereof or their 
complementary sequences. It also provides polynucleotide 
fragments of these sequences that encode polypeptides 
having GST activity, as defined herein. 

A polynucleotide of the invention is capable of 
hybridising selectively with the coding sequence of SEQ 
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ID No- 1, 3, 5, 7, 9, 11 or 13 or to the sequence 
complementary to one of those coding sequences. 
Polynucleotides of the invention include variants of the 
coding sequence of SEQ ID No. 1, 3, 5, 1, 9, 11 or 13 
5 which can function as GSTs, when dimerised with another 

GST subunit. Typically, a polynucleotide of the invention 
is a contiguous sequence of nucleotides which is capable 
of selectively hybridising to the coding sequence of SEQ 
ID. No. 1, 3, 5, 1, 9, 11 or 13 or to the complement of 
10 that coding sequence. 

A polynucleotide of the invention can hybridise to 
coding sequence of SEQ ID No. 1, 3, 5, 7, 9, 11 or 13 at 
a level significantly above background. Background 

15 hybridisation may occur, for example, because of other 
cDNAs present in a cDNA library. The signal level 
generated by the interaction between a polynucleotide of 
the invention and the coding sequence of SEQ ID No. 1, 3, 
br 1, 9, 11 or 13 is typically at least 10 fold, 

20 preferably at least 100 fold, as intense as interactions 
between other polynucleotides and the coding sequence of 
SEQ ID No. 1, 3, 5, 7, 9, 11 or 13. The intensity of 
interaction may be measured, for example, by 
radiolabelling the probe, e.g. with ^^P. Selective 

25 hybridisation is typically achieved using conditions of 
medium to high stringency (for example 0.03M sodium 
chloride and 0.03M sodium citrate at from about 50°C to 
about 60°C) . 

30 A nucleotide sequence capable of selectively 

hybridising to the DNA coding sequence of SEQ ID No. 1, 
3, 5, 7, 9, 11 or 13 or to the sequence complementary to 
one of those coding sequences will be generally at least 
70%, preferably at least 80 or 90% and more preferably at 

35 least 95%, 98% or 99%, homologous to the coding sequence 



of SEQ ID No. 1, 3, 5, 7, 9, 11 or 13 or the complement 
of one of those sequences over a region of at least 20, 
preferably at least 30, for instance at least 40, 60 or 
100 or more contiguous nucleotides. 

Any combination of the above mentioned degrees of 
homology and minimum sizes may be used to define 
polynucleotides of the invention, with the more stringent 
combinations (i.e. higher homology over longer lengths) 
being preferred. Thus for example a polynucleotide which 
is at least 90% homologous over 25, preferably over 30 
nucleotides forms one aspect of the invention, as does a 
polynucleotide which is at least 95% homologous over 40 
nucleotides . 

Polynucleotides of the invention may comprise DNA or 
RNA. They may also be polynucleotides which include 
within them synthetic or modified nucleotides- A number 
of different types of modification to polynucleotides are 
known in the art. These include methylphosphonate and 
phosphorothioate backbones, addition of acridine or 
polylysine chains at the 3' and/or 5' ends of the 
molecule. For the purposes of the present invention, it 
is to be understood that the polynucleotides described 
herein may be modified by any method available in the 
art. Such modifications may be carried out in order to 
enhance the in vivo activity or lifespan of 
polynucleotides of the invention. 

Polynucleotides of the invention may be used to 
produce a primer, e.g. a PGR primer, a primer for an 
alternative amplification reaction, a probe e.g. labelled 
with a revealing label by conventional means using 
radioactive or non-radioactive labels, or the 
polynucleotides may be cloned into vectors. Such primers. 
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probes and other fragments will preferably be at least 
10, preferably at least 15 or 20, for example at least 
25, 30 or 40 nucleotides in length. 



5 Polynucleotides such as a DNA polynucleotide and 

primers according to the invention may be produced 
recombinantly, synthetically, or by any means available 
to those of skill in the art. They may also be cloned by 
standard techniques. The polynucleotides are typically 
10 provided in isolated and/or purified form- 



In general, primers will be produced by synthetic 
means, involving a stepwise manufacture of the desired 
nucleic acid sequence one nucleotide at a time. 
15 Techniques for accomplishing this using automated 
techniques are readily available in the art. 

Genomic clones corresponding to the cDNAs of SEQ ID 
No. 1, 3, 5, 7, 9, 11 and 13 containing, for example 

20 introns and promoter regions are also aspects of the 
invention and may also be produced using recombinant 
means, for example using PGR (polymerase chain reaction) 
cloning techniques, starting with genomic DNA from a 
wheat (Tritlcum aestlvum L,)^ cell, e.g. a wheat shoot 

25 cell or a cell of a plant of a related Tritlcum species, 
for example as described by Feldman et ai., (Scientific 
American, (1981), vol. 244(1) pages 98 to 109). 



Although in general the techniques mentioned herein 
30 are well known in the art, reference may be made in 

particular to Sambrook et ai, 1989, Molecular Cloning: a 
laboratory manual. 



35 



Polynucleotides which are not 100% homologous to the 
sequences of the present invention but fall within the 
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scope of the invention can be obtained in a number of 
ways . 

Other allelic variants of the wheat sequences of SEQ 
5 ID Nos. 1, 3, 5, 1, 9, 11 and 13 including those from 

Triticum aestivum L. species itself related to Triticum 
aestivum L. {cf Feldman et ai, supra) may be obtained for 
example by probing genomic DNA libraries made from a 
range of wheat cells, using probes as described above. 

10 

In addition, other plant homologues of SEQ ID Nos. 
1, 3, 5, 1, 9, 11 and 13 may be obtained and such 
homologues and fragments thereof in general will be 
capable of selectively hybridising to the coding sequence 

15 of SEQ ID No. 1, 3, 5, 7, 9, 11 or 13 or its complement. 

Such sequences may be obtained by probing cDNA or genomic 
libraries from other plant species with probes as 
described above. Degenerate probes can be prepared by 
means known in the art to take into account the 

20 possibility of degenerate variation between the DNA 

sequences of SEQ ID Nos. 1, 3, 5, 1, 9, 11 and 13 and the 
sequences being probed for under conditions of medium to 
high stringency (for example 0.03M sodium chloride and 
0.03M sodium citrate at from about 50°C to about 60^C) . 

25 

Allelic variants and species homologues may also be 
obtained using degenerate PGR which will use primers 
designed to target sequences within the variants and 
homologues encoding likely conserved amino acid 

30 sequences. Likely conserved sequences can be predicted 
from aligning the amino acid sequences of the invention 
(SEQ ID No. 2, 4, 6, 8, 10, 12 and 14) with that of other 
similar GST subunit sequences. The primers will contain 
one or more degenerate positions and will be used at 

35 stringency conditions lower than those used for cloning 
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sequences with single sequence primers against known 
sequences . 

Alternatively, such polynucleotides may be obtained 
5 by site directed mutagenesis of SEQ ID Nos - 1, 3, 5, 7, 
9, 11 or 13 sequences or allelic variants thereof. This 
may be useful where, for example, silent codon changes 
are required to sequences to optimise codon preferences 
for a particular host cell in which the polynucleotide 
10 sequences are being expressed* Other sequence changes may 
be desired in order to introduce restriction enzyme 
recognition sites, or to alter the property or function 
of the polypeptides encoded by the polynucleotides, 

15 The invention further provides double stranded 

polynucleotides comprising a polynucleotide of the 
invention and its complement. 

Polynucleotides, probes or primers of the invention 
20 may carry a revealing label. Suitable labels include 

radioisotopes such as "^^P or ^^S, enzyme labels, or other 
protein labels such as biotin. Such labels may be added 
to polynucleotides, probes or primers of the invention 
and may be detected using techniques known per se. 

25 

The present invention also provides polynucleotides 
encoding the polypeptides of the invention described 
below. Because such polynucleotides will be useful as 
sequences for recombinant production of polypeptides of 
30 the invention, it is not necessary for them to be 
selectively hybridisable to the coding sequence of 
sequence SEQ ID Nos. 1, 3^ 5, 7, 9, 11 or 13 although 
this will generally be desirable. Otherwise, such 
polynucleotides may be labelled, used, and made as 



described above if desired. Polypeptides of the invention 
are described below. 

Particularly preferred polynucleotides of the 
invention are those of SEQ ID No, 1, 3, 5, 7, 9, 11 and 
13 and the polynucleotides that are the coding regions 
within those sequences i.e. the regions which encode the 
polypeptides of SEQ ID No. 2, 4, 6, 8, 10, 12 or 14. 

Polvoeptides 

A polypeptide of the invention consists essentially 
of the amino acid sequence set out in SEQ ID No. 2, 4, 6, 
8, 10, 12 or 14 or a substantially homologous sequence, 
or of a fragment of either of these sequences. In 
general, the naturally occurring amino acid sequences 
shown in SEQ ID Nos . 2, 4, 6, 8, 10, 12 and 14 are 
preferred. However, the polypeptides of the invention 
include homologues of the natural sequences, and 
fragments of the natural sequences and of their 
homologues, which have GST activity. 

The polypeptides of the invention are glutathione 
transferase (GST) subunits . The invention also provides 
dimeric proteins comprising two GST subunits wherein at 
least one subunit is a polypeptide of the invention. 

Thus, the polypeptides of the invention are normally 
functionally active as GSTs when dimerised with another 
GST subunit. Thus, dimeric proteins of the invention are 
capable of catalysing the conjugation of the tripeptide 
glutathione (GSH, gamma-glutamylcysteinyl glycine) and/or 
related derivatives to an electrophilic substrate of 
natural or synthetic origin. Related derivatives include 
homoglutathione {gamma-glutamylcysteinyl alanine ) and 
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hydroxyglutathione (gamma-glutamylcysteinyl serine) . 

Optionally, they may also have one or more of the 
other properties of naturally occurring GSTs including 
5 glutathione peroxidase activity as defined above. 

Preferably, they have GST activity towards one or 
more herbicide substrates. For example, they may have 
activity towards one or more of the following herbicides: 

10 Fluorodifen, Fenoxaprop-ethyl , Metolachlor , Alpha- 
Metolachlor, Acetochlor, Alachlor, Pretilachlor , 
Fluthiamid, Dimethenamid, S-Dimethenamid, Flupyrsulf uron- 
methyl , Trif lusulf uron-methyl , Acif luorf en, Chlorimuron- 
ethyl, Fomesafen, Atrazine, Simazine, Cyanazine, 

15 Metribuzin. Particularly preferred herbicides include 
Fenoxaprop-ethyl , Flupyrsulf uron-me thy 1 , Fluthiamid, 
Acetochlor, Metolachlor and Alpha-Metolachlor - 

Most preferably, a dimeric protein of the invention 
20 is able to catalyse the conjugation of GSH to one or more 
of the following herbicide substrates: Fenoxaprop-ethyl, 
Flupyrsulf uron-me thy 1 , f luthiamid, Acetochlor, 
Metolachlor and Alpha-Metolachlor. 

25 Optionally, a dimeric protein of the invention may 

be able to catalyse the conjugation of GSH to one or more 
non-herbicide substrates, for example CDNB. They may 
also have activity towards phytotoxic non-herbicide 
substrates . 



30 



Optionally, monomeric polypeptides of the invention 
may have GST activity as defined above, even when not 
dimerised - 
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In particular, a polypeptide of the invention may 



comprise : 
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(a) the polypeptide sequence of SEQ ID No. 2, 4, 6, 
8, 10, 12 or 14; 

(b) an allelic variant or species homologue 
thereof; or 

(c) a protein at least 70 80, 90, 95, 98 or 99% 
homologous to (a) or (b) . 

An allelic variant will be a variant which will 
occur naturally in a plant and which will function in a 
substantially similar manner to the protein of SEQ ID No, 
2, 4, 6, 8, 10, 12 or 14, as defined above. Similarly, a 
species homologue of the protein will be the equivalent 
protein which occurs naturally in another plant species 
which can function as GST. Such a homologue may occur in 
plants other than wheat, particularly monocotyledonous 
plants such as related Triticum species, rice, maize, 
oats, rye, barley, triticale or sorghum. Within any one 
species, a homologue may exist as several allelic 
variants, and these will all be considered homologues of 
the protein of SEQ ID No. 2, 4, 6, 8, 10, 12 or 14. 

Allelic variants and species homologues can be 
obtained by following the procedures described herein for 
the production of the polypeptides of SEQ ID No. 2, 4, 6, 
8, 10, 2 and 14 and performing such procedures on a 
suitable cell source e.g. a cell of a wheat genotype 
carrying an allelic variant, or a cell of a plant of a 
different another species. It will also be possible to 
use a probe as defined above nucleotide sequence to probe 
libraries made from plant cells in order to obtain clones 
encoding the allelic or species variants. The clones can 
be manipulated by conventional techniques to generate a 
polypeptide of the invention which can then be produced 
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by recombinant or synthetic techniques known per se. 

A polypeptide of the invention is preferably at 
least 70% homologous to the protein of SEQ ID No. 2, 4, 
5 6, 8, 10, 2 or 14, more preferably at least 80 or 90% and 
more preferably still at least 95%, 97% or 99% homologous 
thereto over a region of at least 20, preferably at least 
30, for instance at least 40, 60 or 100 or more 
contiguous amino acids. Methods of measuring protein 
10 homology are well known in the art and it will be 

understood by those of skill in the art that in the 
present context, homology is calculated on the basis of 
amino acid identity (sometimes referred to as "hard 
homology" ) . 



15 



20 



The sequence of the polypeptides of SEQ ID Nos 2, 4, 
6, 8, 10, 12 and 14 and of allelic variants and species 
homologues can thus be modified to provide polypeptides 
of the invention. 



Amino acid substitutions may be made, for example 
from 1, 2 or 3 to 10, 20 or 30 substitutions. The 
modified polypeptide generally retains activity as a GST, 
as defined herein- Conservative substitutions may be 
25 made, for example according to the following Table. Amino 
acids in the same block in the second column and 
preferably in the same line in the third column may be 
substituted for each other. 
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5 Polypeptides of the invention also include fragments 

of the above-mentioned full length polypeptides and 
variants thereof, including fragments of the sequence set 
out in SEQ ID No, 2, 4, 6, 8, 10, 12 and 14. Such 
fragments typically retain activity as a GST. 

10 

Other preferred fragments include those which 
include an epitope. Suitable fragments will be at least 
about 5, e.g. 10, 12, 15 or 20 amino acids in size. 
Polypeptide fragments of the polypeptides of SEQ ID Nos . 

15 2, 4, 6, 8, 10, 12 and 14, and allelic and species 

variants thereof may contain one or more (e.g. 2, 3, 5, 
or 10) substitutions, deletions or insertions, including 
conserved substitutions. Epitopes may be determined 
either by techniques such as peptide scanning techniques 

20 already known in the art. These fragments will be useful 
for obtaining antibodies to polypeptides and dimeric 
proteins of the invention. 

Polypeptides of the invention may be in a 
25 substantially isolated form. It will be understood that 
the polypeptide may be mixed with carriers or diluents 
which will not interfere with the intended purpose of the 
polypeptide and still be regarded as substantially 
isolated. A polypeptide of the invention may also be in a 
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substantially purified form, in which case it will 
generally comprise the polypeptide in a preparation in 
which more than 90%, e.g. 95%, 98% or 99% of the 
polypeptide in the preparation is a polypeptide of the 
5 invention. 

Polypeptides of the invention may be modified for 
example by the addition of Histidine residues or a T7 tag 
to assist their identification or purification or by the 
10 addition of a signal sequence to promote their secretion 
from a cell - 

A polypeptide of the invention may be labelled with 
a revealing label. The revealing label may be any 
15 suitable label which allows the polypeptide to be 

detected. Suitable labels include radioisotopes, e.g. 
^^^I, ^^S, enzymes, antibodies, polynucleotides and linkers 
such as biotin. 

20 Polypeptides and dimeric proteins of the invention 

may be chemically modified, e.g. post-translationally 
modified. For example, they may be glycosylated or 
comprise modified amino acid residues. Such modified 
polypeptides and proteins fall within the scope of the 

25 terms "polypeptide" and "dimeric protein" of the 
invention . 

Dimeric proteins 

30 The invention also provides dimeric proteins having 

two GST subunits wherein at least one of the two subunits 
is a polypeptide of the invention. These dimeric proteins 
may have two identical subunits of the invention, i.e. 
they may be homodimeric. Alternatively, they may have two 

35 dissimilar subunits; i.e. they may be heterodimeric . 



In heterodimers, the two subunits may both be 
polypeptides of the invention. Alternatively, one subunit 
may be a polypeptide of the invention, whilst the other 
is a different GST subunit. 



Thus, for example, heterodimeric proteins of the 
invention may have one subunit which is a polypeptide of 
the invention, and one which is a known GST subunit from 
maize (e.g. ZmGSTI, ZmGSTII, ZmGSTIII, ZmGSTIV, ZmGSTV or 
ZmGSTVI : see above), or another species. 

Preferably, the dimeric proteins have two subunits 
that are polypeptides of the invention. Various 
combinations of polypeptides of the invention are 
possible. Preferred combinations include: 



TaGSTl-l (SEQ ID 

raGSTl-2 (SEQ ID 

TaGSTl-3 (SEQ ID 

raGSTl-4 (SEQ ID 



No. 2/SEQ ID No. 2) ; 

No. 2/SEQ ID No. 4) ; 

No. 2/SEQ ID No. 6) ; 

No. 2/SEQ ID No. 8) ; 



being representative of the major combinations found 
in GSTs in wheat. 



The invention also provides dimeric proteins having 
two subunits as described above which are fusion 
proteins. In these fusion proteins, the two subunits are 
joined by a linker polypeptide. Any linker may be used as 
long as it does not interfere significantly with the 
correct association of the two subunits or with the GST 
activity of the dimer . Such fusion proteins will 
typically be prepared by joining together the 
polynucleotides encoding the two monomers in the correct 
reading frame, then expressing the composite 



polynucleotide coding sequence under the control of 
regulatory sequences as defined herein. These composite 
polynucleotide coding sequences are a further aspect of 
the invention, as are chimeric genes and vectors 
comprising them, methods of producing them by recombinant 
means, and ceils and plants comprising such vectors or 
chimeric genes. It will be understood that dimeric 
proteins of the invention may be such fusion proteins. 



Vectors and chimeric genes 



Polynucleotides of the invention can be incorporated 
into a recombinant replicable vector. The vector may be 
used to replicate the nucleic acid in a compatible host 
cell- Thus in a further embodiment, the invention 
provides a method of making polynucleotides of the 
invention by introducing a polynucleotide of the 
invention into a replicable vector, introducing the 
vector into a compatible host cell, and cultivating the 
host cell under conditions which bring about replication 
of the vector. The vector may be recovered from the host 
cell. Suitable host cells are described below in 
connection with expression vectors. Bacterial cells, 
especially E. Coli are preferred. 

Expression vectors 



Preferably, a polynucleotide of the invention in a 
vector is operably linked to regulatory sequences capable 
of effecting the expression of the coding sequence by the 
host cell, i.e. the vector is an expression vector. Such 
expression vectors can be used to express the 
polypeptides of the invention. 



The term "operably linked" refers to a juxtaposition 
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wherein the components described are in a relationship 
permitting them to function in their intended manner. A 
regulatory sequence "operably linked" to a coding 
sequence is positioned in such a way that expression of 
5 the coding sequence is achieved under conditions 
compatible with the regulatory sequences. 

Such vectors may be introduced into a suitable host 
cell to provide for expression of a polypeptide or 
10 polypeptide fragment of the invention, as described 
below . 

The vectors may be for example, plasmid, virus or 
phage vectors provided with an origin of replication, 

15 preferably a promoter for the expression of the said 
polynucleotide and optionally an enhancer and/or a 
regulator of the promoter. For expression in plant cells, 
one preferred enhancer is the Tobacco etch virus (TEV) 
enhancer. A terminator sequence may also be present, as 

20 may a polyadenylation sequence. The vectors may contain 
one or more selectable marker genes, for example an 
ampicillin resistance gene in the case of a bacterial 
plasmid or a neomycin resistance gene (e.g. npti or 
nptll) or methotrexate resistance gene for a plant 

25 vector. Vectors may be used in vitro, for example for the 
production of RNA or used to transfect or transform a 
host cell. The vector may also be adapted to be used in 
vivo, for example for generation of transgenic plants of 
the invention. 

30 

So far as plasmid vectors are concerned, plasmids 
derived from the Ti plasmid of Agrobacterium tumefaciens 
are especially preferred, as are plasmids derived from 
the Ri plasmid of Agrobacterium rhizogenes , 
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A further embodiment of the invention provides host 
cells transformed or transfected with the vectors for the 
replication and expression of polynucleotides of the 
5 invention. The cells will be chosen to be compatible with 
the said vector and may for example be prokaryotic 
(bacterial), plant, yeast, insect or mammalian cells, 
bacterial and plant cells being preferred. 

10 Polynucleotides according to the invention may also 

be inserted into the vectors described above in an 
antisense orientation in order to provide for the 
production of antisense RNA . Antisense RNA or other 
antisense polynucleotides may also be produced by 

15 synthetic means. Such antisense polynucleotides may be 

used in a method of controlling the levels of GSTs having 
the sequence of SEQ ID No. 2, 4, 6, 8, 10, 12 or 14 or 
their variants or species homologues in planta. 

20 Promoters and other regulatory elements may be 

selected to be compatible with the host cell for which 
the expression vector is designed. 

Promoters suitable for use in plant cells may be 
25 derived, for example, from plants or from bacteria that 
associate with plants or from plant viruses, thus, 
promoters from Agrobacterium spp. including the nopaline 
synthase (nos) , octopine synthase (ocs) and mannopine 
synthase (mas) promoters are preferred. Also preferred 
30 are plant promoters such as the ribulose bisphosphate 

small subunit promoter (rubisco ssu) , and the phaseolin. 
promoter. Also preferred are plant viral promoters such 
as the cauliflower mosaic virus (CAMV) 35S and 19S 
promoters - 
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Depending on the pattern of expression desired, 
promoters may be constitutive or inducible. For example, 
strong constitutive expression in plants can be obtained 
with the CAMV 35S or rubisco ssu promoters. Also, tissue- 
specific or stage-specific promoters may be used to 
target expression of polypeptides of the invention to 
particular tissues in a transgenic plant or to particular 
stages in its development- Chemically inducible promoters 
such as those activated by herbicide safeners may also be 
used, for example the maize GST 27 promoter (W097/11189) , 
the maize In2-1 promoter (WO90/11361 ) , the maize In2-2 
promoter (De Veylder et ai, Plant cell Physiology, Vol. 
38, pp568-577 (1997) , 

Especially where expression in plant cells is 
desired, other regulatory signals may also be 
incorporated in the vector, for example a terminator 
and/or polyadenylation site. One preferred terminator is 
the nos terminator although other terminators functional 
is the nos terminator in plant cells may also be used. 

Additionally, sequences encoding secretory signals 
or transit peptides may be included. On expression, these 
elements direct secretion from the cell or target the 
polypeptide of the invention to a particular location 
within the cell. For example, sequences may be added to 
target the expressed polypeptide to the nucleus or 
plastids (e.g. chloroplasts ) of a plant cell. 

Chimeric genes 

The invention also provides chimeric genes suitable 
for securing the expression of polypeptides of the 
invention in a host cell, preferably a plant cell. These 
comprise a polynucleotide of the invention, operably 
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linked to regulatory sequences that allow its expression 
in a host cell, preferably a plant cell. 

Typically, therefore, a chimeric gene comprises the 
5 following elements in 5 * to 3 ' orientation: a promoter 

functional in a host (preferably plant) cell, as defined 
above, a polynucleotide of the invention and a terminator 
functional in said cell, as defined above. Other 
elements, for example an enhancer, may also be present. 
10 These chimeric genes may be incorporated into vectors, as 
defined above. 

Expression in host cells 

15 Expression vectors of the invention may be 

introduced into host cells using conventional techniques 
including calcium phosphate precipitation, DEAE-dextran 
transf ection, or electroporation . For plant cells, 
preferred transformation techniques include 

20 electroporation of plant protoplasts, transformation by 
Agrobacterium tumefaciens and particle bombardment. 
Particle bombardment is particularly preferred for 
transformation of monocot cells. 

25 Expression in the host cell may be transient 

although, preferably, integration of the polynucleotide 
or chimeric gene of the invention into the cell's genome 
is achieved. 

30 Suitable cells include cells in which the above- 

mentioned vectors may be expressed- These include 
microbial cells such as bacteria such as E. coli, plant 
cells, mammalian cells such as CHO cells, C0S7 cells or 
Hela cells, insect cells or yeast such as Saccharomyces , 



Bacterial and plant cells are preferred. 

Optionally, cells of the invention may comprise one 
or more further polynucleotide sequences encoding a GST 
subunit, operably linked to regulatory sequences, as 
defined above, that allow expression of the subunit in 
the cell. Such polynucleotide sequences may be further 
polynucleotides of the invention or they may encode other 
GST subunits as defined above with respect to dimeric 
proteins . 

Such polynucleotides may be naturally present in the 
cell, e.g. if it is a plant cell or they may be 
introduced artificially, e.g. as defined above. 

Such cells allow the production of heterodimeric 
proteins of the invention where the polynucleotides 
encode different GST subunits, or the production of 
monomeric polypeptides of the invention and/or 
homodimeric proteins of the invention in greater 
quantities. For example, they may allow the expression 
of active heterodimeric enzymes. 

Cell culture will take place under standard 
conditions. Commercially available cultural media for 
cell culture are widely available and can be used in 
accordance with manufacturers ' instructions . 

Processes for production of polypeptides and dimeric 
proteins 



The invention provides 
of polypeptides and dimeric 
recombinant means. 



processes for the production 
proteins of the invention by 
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Generally, monomeric GST subunits of the invention 
spontaneously dimerise to form homodimers and/or 
heterodimers of the invention. Thus, in general, 
expression of polypeptides of the invention gives rise to 
5 dimers in the first instance- These dimers may be the 
desired product; alternatively, it may be desirable to 
separate the monomers. For example, as described below, 
it may be desired to separate the monomeric subunits of a 
homodimer in order to combine them with different 
10 monomeric subunits, thereby yielding heterodimers. 

Processes for the production of polypeptides of the 
invention may comprise: 

15 (a) cultivating a transformed cell as defined 

above under conditions that allow the 
expression of the polypeptide; 



20 



and preferably 

(b) recovering the expressed polypeptide. 



For example, the expressed monomeric peptides may be 
recovered by denaturation of dimers formed by them, which 
25 separates the subunits. Then, the monomers can be 
recovered and renatured. Typically, they will then 
redimerise , 

Processes for production of dimeric proteins of the 
30 invention may comprise: 

(a) cultivating a transformed cell as 

defined above under conditions that 
allow 
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(i) the expression of the polypeptide of 
the invention and, if a further GST 
subunit-encoding sequence as defined 
above is present, optionally the 
expression of a further GST subunit 
encoded by the further sequence 



and preferably 



(ii) the association of the GST 

subunit polypeptide of the invention 
with another identical GST subunit 
polypeptide to form a home dimeric 
protein of the invention; and/or 



(ii) the association of the GST subunit 
polypeptide of the invention with a 
non-identical GST subunit to form a 
heterodimeric protein of the 
invention , 

and preferably 

(b) recovering the dimeric proteins so 
formed, and optionally resolving 
them. 

Where only a single type of GST subunit-encoding 
sequence of the invention is present in the transformed 
cell, these processes normally give rise to homodimeric 
proteins of the invention. Where one or more further GST 
subunit-encoding sequences is present, these processes 
give rise to heterodimers or to a mixture of some or all 
of the following: homodimers of each possible type. 
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Alternati vely , dimeric proteins of the invention can 
be produced by expressing the required polypeptide 
subunits in separate cells. This typically leads to the 
5 production of two different types of homodimer. The 

desired heterodimer can then be prepared by: mixing the 
homodimers and denaturing the mixed sample, or by 
denaturing the homodimers separately and then mixing 
them; then renaturing the mixed sample. This will 

10 typically lead to a mixture of dimeric proteins 

comprising both possible types of homodimers and also 
heterodimers comprising one subunit of each type. 
Similarly, mixtures of greater numbers of types of dimer 
can be produced in this way if different homodimers are 

15 produced in three or more different cells, or if cells 
that give rise to heterodimers are used. 

For these processes, any transformed cell as 
described above may be used. Bacterial cells are 
20 preferred, especially cells of E. coll, although other 
cell types may also be used. 

Optionally, the polypeptide or dimeric protein may 
be isolated and/or purified, by techniques known in the 
25 art. 

In processes of the invention, any suitable method 
may be used to denature and/or renature polypeptides of 
the invention, and suitable methods are well known in the 
30 art. 

Similarly, where a mixture of polypeptide subunits 
or dimeric proteins results, these may be resolved or 
separated by any suitable technique known in the art. 

35 
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Antibodies 

The invention also provides monoclonal or polyclonal 
antibodies which specifically recognise polypeptides of 
5 the invention or dimeric proteins of the invention . 

Thus, antibodies of the invention bind specifically 
to the polypeptides and/or dimers of the invention, 
preferably to the extent that they distinguish between 
10 the polypeptides and/or dimers of the invention and other 
GST subunits and GSTs. 

Monoclonal antibodies may be prepared by 
conventional hybridoma technology using polypeptides or 

15 dimeric proteins of the invention as immunogens , 
Polyclonal antibodies may also be prepared by 
conventional means which comprise inoculating a host 
animal, for example a rat or a rabbit, with a polypeptide 
of the invention and recovering immune serum. In order 

20 that such antibodies may be made, polypeptides may be 

haptenised to another polypeptide for use as immunogens 
in animals or humans- For the purposes of this invention, 
the term "antibody" includes antibody fragments such as 
Fv, F(ab) and F(ab)2 fragments, as well as single chain 

25 antibodies. 

Methods of producing transgenic plant cells, plant parts 
and tissues, plants and seeds of the invention 

30 Transgenic plant cells, plant parts and tissues, 

plants and seeds of the invention are transgenic in the 
sense that the y have at least one polynucleotide of the 
invention introduced into them. 
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The invention provides a method of obtaining a 
transgenic plant cell comprising transforming a plant 
cell with an expression vector of the invention to give a 
transgenic plant cell; and optionally transforming the 
5 cell with one or more further polynucleotide sequences 
coding for a GST subunit, operably linked to regulatory 
elements that allow expression of the subunit in the 
cell. (As discussed above, this allows the production of 
heterodimeric GST dimers of the invention, or the 
10 production of homodimeric ones of the invention in 
greater quantities.) 

Any suitable transformation method may be used, for 
example the transformation techniques described herein. 
15 Preferred transformation techniques include 

electroporation of plant protoplasts, transformation by 
Agrobacterium tumefaclens and particle bombardment. 
Particle bombardment is particularly preferred for 
transformation of monocot cells. 

20 

The cell may be in any form, for example, it may be 
an isolated cell, e.g. a protoplast, or it may be part of 
a plant tissue, e.g. a callus, or a tissue excised from a 
plant, or it may be part of a whole plant. Transformation 
25 may thus give rise to a chimeric tissue or plant in which 
some cells are transgenic and some are not. 

Preferably, integration of a polynucleotide or 
chimeric gene of the invention into the cell's genome is 
30 achieved. 

The thus obtained cell may be regenerated into a 
transgenic plant by techniques known in the art. These 
may involve the use of plant growth substances such as 
35 auxins, giberellins and/or cytokinins to stimulate the 

growth and/or division of the transgenic cell. Similarly, 



0 . . # 



-40- 

techniques such as somatic embryogenesis and lueristem 
culture may be used . 

In many such techniques, one step is the formation 
of a callus , i.e. a plant tissue comprising expanding 
and/or dividing cells. Such calli are a further aspect of 
the invention as are other types of plant cell cultures 
and plant parts. Thus, for example, the invention 
provides transgenic plant tissues and parts, including 
embryos, meristems, seeds, shoots, roots, stems, leaves 
and flower parts. These may be chimeric in the sense that 
some of their cells are transgenic and some are not . 

Regeneration procedures will typically involve the 
selection of transformed cells by means of marker genes. 
Some marker genes have already been mentioned and it 
should also be noted that the polynucleotides of the 
invention can themselves act as marker genes if they are 
under the control of regulatory sequences that allow 
their expression during the appropriate stage of the 
regeneration procedure. The polypeptides of the invention 
are capable of conferring resistance to herbicides or 
other phytotoxic compounds which are detoxified by GSTs 
on cells of the invention, as described below. Thus, an 
appropriate herbicide can be used to select 
transf ormants . 

The regeneration step gives rise to a first 
generation transgenic plant. The invention also provides 
methods of obtaining transgenic plants of further 
generations this first generation plant . These are known 
as progeny transgenic plants . progeny plants , of second, 
third fourth, fifth, sixth and further generations may 
be obtained from the first generation transgenic plant 
by any means known in the art. 
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Thus, the invention provides a method of obtaining 
a transgenic progeny plant comprising obtaining a second- 
generation transgenic progeny plant from a first- 
5 generation transgenic plant of the invention, and 

optionally obtaining transgenic plants of one or more 
further generations from the second-generation progeny 
plant thus obtained. 

10 Such progeny plants are desirable because the first 

generation plant may not have all the characteristics 
required for cultivation. For example, for the production 
of first generation transgenic plants, a plant of a taxon 
that is easy to transform and regenerate may be chosen. 

15 It may therefore be necessary to introduce further 

characteristics in one or more subsequent generations of 
progeny plants before a transgenic plant more suitable 
for cultivation is produced, 

20 Progeny plants may be produced form their 

predecessors of earlier generations by any known 
technique. In particular, progeny plants may be produced 
by : 

25 obtaining a transgenic seed from a transgenic plant of 
the invention belonging to a previous generation, then 
obtaining a transgenic progeny plant of the invention 
belonging to a new generation by growing up the 
transgenic seed; 

30 

and/or 

propagating clonally a transgenic plant of the invention 
belonging to a previous generation to give a transgenic 
35 progeny plant of the invention belonging to a new 
generation; 



and/or 



crossing a first-generation a transgenic plant of the 
invention belonging to a previous generation with another 
compatible plant to give a transgenic progeny plant of 
the invention belonging to a new generation; 



and optionally; 



obtaining transgenic progeny plants of one or more 
further generations from the progeny plant thus obtained. 

These techniques may be used in any combination, 
for example, clonal propagation and sexual propagation 
may be used at different points in a process that gives 
rise to a transgenic plant suitable for cultivation. In 
particular, repetitive back-crossing with a plant taxon 
with agronomically desirable characteristics may be 
undertaken. Further steps of removing cells from a plant 
and regenerating new plants therefrom may also b e 
carried out - 

Also, further desirable characteristics may be 
introduced by transforming the cells, plant tissues, 
plants or seeds, at any suitable stage in the above 
process, to introduce desirable coding sequences other 
than the polynucleotides of the invention, this may be 
carried out by the techniques described herein for the 
introduction of polynucleotides of the invention . 



For example, further transgenes may be selected 
from those coding for other herbicide resistance traits; 
e.g. tolerance to Glyphosate (e.g. using an EPSP synthase 
gene (e.g. EP-A-0 293,358) or a glyphosate oxidoreductase 
(WO 92/000377) gene); or tolerance to fosametin; a 
dihalobenzonitrile; glufosinate (e.g. using a 
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phosphinot ricyine acetyl transferase or glutamine 
synthase gene (cf. EP-A-0 242,236); asulam (e.g. using a 
dihydropteroate synthase gene (EP-A-O 369,367); or a 
sulphonylurea (e.g. using an ALS gene); diphenyl ethers 
5 such as acifluorfen or oxyfluorfen (e.g. using a 

protoporphyrogen oxidase gene); an oxadiazole such as 
oxadiazon; a cyclic imide such as chlorophthalim; a 
phenyl pyrrazole such as TNP, or a phenopylate or 
carbamate analogue thereof. 

10 

Similarly, genes for beneficial properties other 
than herbicide tolerance may be introduced. For example, 
genes for insect resistance may be introduced, notably 
genes encoding Bacillus thuringiensis (Bt) toxins. 

15 

Transgenic plant cells, plant parts and tissues, plants 
and seeds of the invention 

20 The invention also provides transgenic plant cells, 

plant parts and tissues, plants and seeds, these are 
typically obtainable, or obtained, by the methods 
described above. They may be of any botanical taxon, e.g. 
any species or lower taxonomic grouping. Preferably, they 

25 are of a crop pant species. 

Transgenic plant cells, plant parts and tissues, 
plants and seeds of the invention may thus be of a 
monocotyledonous (monocot) or dicotyledonous (dicot) 

30 taxon. Preferred dicot crop plants include tomato; 

potato; sugarbeet; cruciferous crops, including oilseed 
rape; linseed; tobacco; sunflower; fibre crops such as 
cotton; and leguminous crops such as peas, beans, 
especially soybean, and alfalfa. Preferred monocots 

35 include graminaceous plants such as wheat, maize, rice, 
oats, barley and rye, sorghum, triticale and sugar cane. 



Wheat is particularly preferred. 



Typically, a polypeptide of the invention is 
expressed in a plant of the invention, depending on the 
promoter used, this expression may be constitutive or 
inducible, e.g. by a herbicide safener. similarly, it may 
be tissue- or stage-specific, i.e. directed towards a 
particular plant tissue or stage in plant development. 



Preferably, plant cells, plant parts and tissues, 
plants and seeds of the invention exhibit herbicide 
resistance due, at least in part, to expression of a 
polypeptide of the invention. 



Herbicides to which plants of the invention may be 
resistant include Fluorodifen, Fenoxaprop-ethyl , 
Metolachlor , Alpha-Metolachlor , Acetochlor , Alachlor , 
Pretilachlor , Fluthiamid, Dimethenamid, S-Dimethenamid, 
Flupyr sulf uron-methyl , Trif lusulf uron-methyl , 
Acif luorf en, Chlorimuron-ethyl , Fomesaf en, Atrazine , 
Simazine, Cyanazine, and Metribuzin. Particularly 
preferred herbicides include Fenoxaprop-ethyl, 
Flupyrsulf uron-methyl , Fluthiamid, Acetochlor , 
Metolachlor and Alpha-Metolachlor. Plants of the 
invention may also exhibit resistance to other herbicides 
capable of conjugation to GSH by GSTs or to other non- 
herbicide phytotoxic substances. 



Preferably, a transgenic plant of the invention 
exhibits resistance to one or more of Fenoxaprop-ethyl, 
Flupyrsulf uron-methyl , Fluthiamid, Acetochlor, 
Metalochlor and Alpha-Metolachlor. Resistance may be 
exhibited to herbicides which are selective for 
particular plant taxa and/or herbicides which are generic 
to all plants. 
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Uses of the polynucleotides, polypeptides, antibodies, 
probes and plants of the invention 

Apart from enabling the generation of herbicide- 
resistant plants, the invention has a number of other 
uses . 

Selectable markers 

Polynucleotides of the invention can be used as 
selectable markers for detecting the transformation of 
plant cells. When expressed from polynucleotides of the 
invention, the polypeptides of the invention are capable 
of conferring herbicide resistance on cells of the 
invention, as described herein. Thus, an appropriate 
herbicide can be used to select transf ormants . 

Accordingly, the invention provides a nucleic acid 
construct comprising : 

(a) a polynucleotide of the invention operably linked 
to regulatory elements that allow expression of a 
polynucleotide of the invention a plant cell; and 

(b) a site into which a further polynucleotide 
comprising a coding sequence can be inserted - 

Preferably, site (b) is bounded by regulatory 
elements that allow expression of a coding sequence 
inserted at the site in a plant cell. 

These constructs may be contained within vectors as 
described herein. 

In these constructs, site (b) is a site into which 



another nucleic acid sequence can be inserted, in cells 
transformed with the constructs or vectors containing 
them, expression of the polypeptide of the invention can 
be used as a selectable marker, indicating that the 
polynucleotide at site (b) has also been successfully 
introduced . 

In this connection, the invention also provides a 
method of transforming a plant cell or of obtaining a 
plant cell culture or transgenic plant comprising: 

(a) providing an untransf ormed plant cell which is 
susceptible to a herbicide whose herbicidal activity is 
reduced by a dimeric protein of the invention; 

(b) transforming the plant cell with a vector 
comprising a marker construct of the invention; 

(c) cultivating the transformed cell under 
conditions that allow the expression of a polypeptide of 
the invention; 

and /or 

(c ') regenerating the cell to give a cell culture or 
plant such that a polypeptide of the invention is 
expressed; 



and 



(d) contacting the cell, cell culture or plant with 
the herbicide whose herbicidal activity is reduced by a 
dimeric protein of the invention, and to which herbicide 
the untransf ormed plant cell was susceptible; 



and 
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(e) selecting cells, cell cultures or plants that 
are less susceptible to the herbicide than are 
corresponding untransf ormed cells, cell cultures or 
plants . 

Identification of novel herbicides 

The polypeptides and dimeric proteins of the 
invention may be used to identify compounds capable of 
conjugation to GSH. Thus, as conjugation to GSH is the 
mechanism by which GSTs are believed to effect 
detoxification of herbicides, the polypeptides of the 
invention can be used to determine whether or not a 
candidate herbicidal compound will be detoxified by GSTs, 
for example the dimeric proteins of the invention. In 
this case, it may be possible to develop the candidate 
compound as a herbicide. In particular, it may be 
possible to develop the candidate compound for selective 
use as a herbicide on crops of wheat, or of a wheat- 
related species, or of other plants (cf Feldman et al 
supra), having GSTs with similar activity to the dimeric 
proteins of the invention. This is because species 
having such GSTs can be expected to detoxify herbicides 
identified in the assay. 

Accordingly, the invention provides a method of 
identifying compounds capable of conjugation to 
glutathione comprising: 

(a) contacting a candidate compound suspected of being 
capable of being metabolised by glutathione transferase 
with glutathione (GSH) in the presence of a dimeric 
protein of the invention; and 



(b) 



determining whether or not metabolism of the 
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candidate compound takes place, or to what extent takes 
place . 

Preferably, metabolism of the compound is detected 
5 by determining whether, or to what extent, conjugation of 
the candidate compound to GSH takes place. 

Such assay methods may be carried out by any 
suitable means known in the art. Compounds may be assayed 
10 singly, or, preferably, in batches containing several 
compounds. For example, microtitre plate-based assay 
techniques may be used. More specifically, the techniques 
of Example 3 below may be used. 

15 The invention also provides compounds identified by 

the methods of the invention . 

The invention also provides a kit for detecting 
compounds capable of being metabolised by a GST 
20 comprising: 

(a) reduced glutathione or homoglutathione ; and 

(b) a dimeric protein of the invention. 

25 Such kits may also comprise other components, 

especially buffer solutions, e.g. aqueous solutions 
buffered at a suitable pH (e.g. pH7 to pHlO, preferably 
pH7 to pH8) . 

30 These kits can be used in the identification of 

novel herbicides. 

Identification of compounds that induce GST 
expression 

35 

We have found that expression of the GSTs of the 
invention is inducible by herbicide safeners. As GSTs are 
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implicated in herbicide resistance, it may be desirable 
to identify other compounds capable of inducing their 
expression or that of related GSTs in wheat or other 
plants, preferably graminaceous plants. Such compounds 
5 may, for example, be used to induce expression of GSTs 

involved in herbicide tolerance. This will be beneficial 
as it will allow crop plants to be selectively protected 
from herbicides whilst weeds are killed by them. 

10 Accordingly, the invention provides a method of 

identifying compounds that induce GST expression in 
graminaceous plants comprising: 

(a) contacting a plant, preferably a graminaceous 
15 plant, or a cell or cell culture thereof, with a 

candidate compound suspected of being capable of inducing 
GST expression; and 

(b) determining the level of GST expression in the 
20 plant, cell or cell culture. 

Typically, the level of expression is also 
determined before the compound is added, or in an 
untreated sample, in order to provide a control. If the 
25 level of GST expression in the test sample is higher than 
that in the control sample then the candidate compound is 
an inducer of GST expression. 

Preferably, the level of GST expression is 
30 determined quantitatively although, in certain 

situations, quantitative detection may suffice, e.g. 
where the level of expression is zero or undetectable in 
the absence of an inducer. 

35 Determination of the level of GST expression may be 

performed by any suitable means. Preferably, it is 
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performed using antibodies or probes of the invention, as 
described herein. 



The invention also provides compounds identified by 
these methods. 



Antibodies that specifically recognise the 
polypeptides or dimeric proteins of the invention can be 
used to detect and preferably quantify GST expression by 
detecting them directly. The antibodies of the invention 
may thus be used for detecting polypeptides or dimeric 
proteins of the invention present in plant samples, e.g. 
by a method which comprises: 



(a) providing an antibody of the invention; 

(b) incubating a plant sample with said antibody 
under conditions which allow for the formation 
of an antibody-antigen complex; and 

(c) determining, by any suitable technique known in 
the art, whether antibody-antigen complex 
comprising said antibody is formed. 



Antibodies of the invention may be bound to a solid 
support and/or packaged into kits in a suitable container 
along with suitable reagents, controls, instructions and 
the like. 



Similarly, polynucleotides or primers of the 
invention or fragments thereof, labelled or unlabelled, 
may be used by a person skilled in the art in nucleic 
acid-based tests for detecting nucleic acid sequences of 
the invention in a sample taken from a plant, typically a 
wheat plant. 
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Such tests generally comprise bringing a sample 
containing DNA or RNA into contact with a probe 
comprising a polynucleotide or primer of the invention 
under hybridising conditions and detecting any duplex 
5 formed between the probe and nucleic acid in the sample. 
Such detection may be achieved using techniques such as 
PGR or by immobilising the probe on a solid support, 
removing nucleic acid in the sample which is not 
hybridised to the probe, and then detecting nucleic acid 
10 which has hybridised to the probe. 

Alternatively,- the sample nucleic acid may be 
immobilised on a solid support, and the amount of probe 
bound to such a support can be detected. 

15 

The probes of the invention may conveniently be 
packaged in the form of a test kit in a suitable 
container- In such kits the probe may be bound to a solid 
support where the assay format for which the kit is 
20 designed requires such binding. The kit may also contain 
suitable reagents for treating the sample to be probed, 
hybridising the probe to nucleic acid in the sample, 
control reagents, instructions, and the like. 

25 Measuring the level of GST in batches o f seed or 

flour 

Owing to the secondary activity of the GSTs of the 
invention as glutathione peroxidases, the polypeptides 

30 and dimeric proteins of the invention will also have 
applications in determining the quality of batches of 
seed and flour, especially of wheat seed, grain and wheat 
flour. In such batches, glutathione peroxidases are 
involved in reducing lipid hydroperoxides, which reduces 

35 the amount of GSH available. In particular, this occurs 



during bread making. Thus, it is desirable to be able to 
monitor the level of GSTs having glutathione peroxidase 
activity in batches of seed and flour. 



This can be done by any suitable means. For example, 
antibodies of the invention can be used to detect 
polypeptides or dimeric proteins of the invention, as 
described above. Similarly, probes of the invention can 
be used to detect GST mRNA, as described above. 

Alternatively, to determine directly the level of GSH in 
a batch, the invention provides a method of determining 
the GSH level in a batch of seed or flour comprising: 

(a) contacting a sample from the batch with a 
polypeptide or dimeric protein of the invention and a 
compound whose conjugation to GSH is catalysed by the 
polypeptide or protein; and 

(b) determining the GSH level from the extent of 
reaction between the compound and GSH, 

Controlling the growth of weeds 



The invention also provides a method of controlling 
the growth of weeds at a locus where a transgenic plant 
of the invention is being cultivated, which method 
comprises applying a herbicide to the locus. Any amount 
of herbicide may be used, as long as it is herbicidally 
effective against the weeds but leaves the herbicide 
resistant plants of the invention unaffected, or 
substantially unaffected. The effect on the weeds may be, 
for example, to kill them or to inhibit their growth. 

Any type of weed that responds to a particular 
herbicide may be controlled in this way, Alopecurus 
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10 



myosuroides f Avena fatua^ Lolium spp., Bromus spp, r Poa 
annua^ Galium aparinsr Aper spica-venti ^ Matricaria 
inodora. Stellar ia media, Papaver rholas. Polygonum spp . , 
Setaria sp.. Sorghum halapense, Panicum miliaceum, 
Echinochloa spp., Digitaria sanguinalis, Phalaris minor. 
Abut Hon theophrasti , Amaranthus retro flexus , chenopodium 
album. Datura stramoniuon , Solanum nigrum, Xanthium 
strumarium, saggitaria spp . , Monochoria vaginalis , 
Lindernia spp,, Eleokaris kurogaai, Scirpus juncoides, 
Cyperus spp. 



15 



The herbicide may, for example, be one whose 
activity is identified by the methods of the invention 
(see above) . Alternatively, it may be a known herbicide, 
for example one of the herbicides mentioned herein. 



The herbicide may be applied at any suitable time 
during the life cycle of the transgenic plant, for 
example pre-emergence or post-emergence. Timing of 

20 application will be tailored to the development of the 

weeds which it is desired to control. Where inducible or 
tissue- and/or stage- specific expression of the active 
dimer of the invention is employed, timing of herbicide 
application will be tailored to the optimal expression of 

25 the invention in the course of the development of the 
transgenic plant of the invention. 

The following Examples illustrate the invention. 



30 



EXAMPLES 



Example 1: Isolation and characterisation of tlie 
nucleotide secjuence encoding TaGSTl 



35 



(a ) Purification of wheat GST isoenzymes 
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Wheat GST isoenzymes containing the TaGSTl subunit 
were purified by the method of Dixon et al (Pestic. Sci . 
1997, 50, 72-82) . This is summarised below. 

5 Wheat seeds (Triticum aestivum L. var. Hunter) were 

imbibed in a 10 mg/1 solution of the herbicide safener 
f enchlorazole-ethyl and then grown in an environmental 
growth room with further root -applied watering treatments 
of 5 mg/1 f enchlorazole ethyl applied as required. At 10 

10 days after imbibing, the shoot tissue was harvested and 
extracted prior to precipitation of the protein with 
ammonium sulphate (80% saturation). The total protein 
extract was then applied in the presence of 1 M ammonium 
sulphate to a phenyl-Sepharose column. The bound GSTs 

15 were then recovered, firstly by reducing the ammonium 
sulphate concentration to 0 M to yield the polar GST 
fraction, which represented 61% of the recovered activity 
toward l-chloro-2 , 4-dinitrobenzene (CDNB) . The remaining 
39% of the GST activity was then recovered by adding 

20 ethylene glycol (50 % v/v) to the running buffer to yield 
the hydrophobic GST fraction. 

The polar and hydrophobic GST fractions were then 
independently applied to the affinity matrix, S-hexyl- 
glutathione agarose. This matrix bound 90% of the GST 

25 activity toward CDNB, Prior to elution of the column with 
the ligand, S-hexyl -glutathione, the matrix was washed 
with phosphate buffer, followed by phosphate buffer 
containing 200 mM potassium chloride. The GSTs eluting in 
this salt wash were termed the "loosely-bound" fraction. 

30 Tightly-bound proteins were then eluted with 5 mM S- 

hexyl-glutathione . With both the polar and hydrophobic 
GSTs an average of 34% of the GST activity toward CDNB 
eluted in the loosely-bound fraction and 66% eluted in 
the presence of 5-hexyl-glutathione . The loosely-bound 

35 fraction contained the GSTs which will be considered in 
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Example 2. The major wheat GSTs of interest in this 
example were found in the affinity-purified pool and to 
define the numbers of isoenzymes and component subunits 
present, this pool was analysed in detail. 

5 

When the affinity-bound pools of the polar and 
hydrophobic GSTs were analysed by anion-exchange 
chromatography on Q-sepharose, the partial resolution of 
the eluting activity suggested the presence of multiple 

10 isoenzymes (Figure 1). The component polypeptides in the 
active fractions were then analysed by silver staining 
after resolution by sodium dodecyl sulphate 
polyacrylamide gel electrophoresis (SDS-PAGE) . It was 
concluded that in f enchlorazole-ethyl-treated wheat the 

15 polar GSTs were composed of 25 kDa and 26 kDa 

polypeptides, while the hydrophobic fraction contained 
25kDa, 26 kDa and 24 kDa polypeptides. Further analysis 
by reversed-phase HPLC confirmed the subunit compositions 
(Figure 2). Based on the combined analyses by Q- 

20 sepharose, HPLC and SDS-PAGE these GST polypeptides were 
named as described in Table 1, which also contains 
details of how these subunits associate together to form 
the active dimers found in plants and the relative 
abundance of these subunits in extracts from unsafened 

25 and f enchlorazole-ethyl treated (safened) plants. 

(b) GST activities of the purified TaGST isoenzymes 

The purified isoenzymes were assayed for GST 
30 activity toward herbicides using the HPLC-based assays 
described by Edwards R. and Cole D.J. (Pesticide 
Biochemistry and Physiology Vol. 54, pp96-104 (1996)) and 
the results are presented in Table 2. Both polar and 
hydrophobic GSTs from the affinity-bound pools of 
35 isoenzymes showed detoxifying activities toward the 
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selective graminicide f enoxaprop-ethyl , the diphenyl- 
ether herbicide fluorodifen, and the chloracetanilide 
metolachlor . These isoenzymes had additional activities 
as glutathione peroxidases able to reduce linoleic acid 
5 hydroperoxide, a major reaction product formed during 

membrane peroxidation in plants (Williamson and Beverly. 
J. Cereal Sci . 8., 1988, 155-163). 

(c) Preparation of polyclonal antibodies to the major 
10 wheat GST isoenzymes 

Purified TaGSTl-l was used to immunise rabbits to 
raise polyclonal antibodies to the differing isoenzymes. 
The reactivity of the antiserum toward crude wheat 
15 preparations was demonstrated with a Western blot of 

polypeptides resolved by SDS-PAGE. The antibodies were 
then used to identify the corresponding cDNAs in an 
expression library . 

20 (d) Identification and characterisation of a cDNA 

encoding TaGSTl 

An expression library was prepared from poly (A) + 
RNA extracted from 7-day wheat shoots grown from seed 

25 treated with f enchlorazole-ethyl . The library was 

constructed in lambda ZAP II (Stratagene) and plaque 
forming units (pfus) screened with the antiserum raised 
against TaGSTl-l. From an initial screen of 170,000 pfus 
17 positive plaques were identified, of which 12 were 

30 further purified to homogeneity in secondary and tertiary 
screens and the wheat cDNAs excised from the phage to 
form Bluescript plasmids in E. coli SOLR. (Stratagene). 
Automated DNA sequencing showed that all clones had an 
identical coding sequence, although differences in the 5' 

35 and 3 ' untranslated regions were apparent , such that of 6 
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clones sequenced fully on both strands, 4 different 
untranslated regions were observed . Since these clones 
shared a common open reading frame they were all 
— d e-s-i g n a-ted — TaGS~T^ — an d- -t hen— s ubd i-v-i-d e d as A, C and D- 

The nucleotide sequence of TaGSTl showing the variable 
untranslated regions of A, B, C and D is shown in SEQ ID 
No. 1, together with the deduced amino acid sequence of 
the coding region (SEQ ID No. 2) . 

To confirm that TaGSTl encoded a GST, it was 
expressed as a fusion protein with beta-galactosidase 
using the pBluescript plasmid in £7. coJi strain SOLR . 
TaGSTl clones were inoculated into LB liquid medium and 
were grown overnight at 37°C on an orbital shaker in the 
presence of IPTG . Bacteria were then pelleted by 
centrif ugation, lysed by sonication and assayed for GST 
activity toward CDNB and analysed by SDS-PAGE and Western 
blotting using the anti- TaGSTl-1 serum. With all six 
TaGSTl clones , GST activity could be determined in the 
crude extracts in the range 30 - 50 nkat/ mg crude 
lysate. This was in contrast to control £. coll 
containing the blue script plasmid without a cDNA insert 
which showed negligible GST activity (0.2 nkat/mg) . When 
the polypeptides contaLned in the ly sates of the various 
TaGSTl clones were analysed by SDS-PAGE, in every case 
the TaGSTl -fusion protein was clearly visible as a highly 
expressed polypeptide relative to the controls. The 
differences in size of the fusion proteins resulted from 
variations in the length of 5 ' untranslated sequence from 
the TaGSTl included in the fusion. All the fusion 
proteins reacted with the anti-TaGSTl serum. 

To confirm that the GST activity in the extracts 
from TaGSTl clones was due to the fusion protein, the 



GST-fusion was purified from the clone raGSTl.2 using 
hexyl-glutathione agarose affinity chromatography. The 
pure fusion protein was then assayed for enzyme activi 
toward herbicide and hydroperoxide substrates and was 
found to show a similar spectrum of activities to that 
the pure TaGSTl-l isoenzyme from wheat shoots. 
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Table 1 

Siunmary of the characteristics of major classes of wheat 
GST isoenzymes . 

5 

The GST subunits had the following retention times 
by reversed-phase HPLC. TaGSTla - 26.4 min, TaGSTlb - 

27.1 min, raGST2 - 31.1 min, TaGSTS - 30.9 min, raGST4 - 

33.2 min. 

10 



15 



20 



25 



Isoenzyme 

TYPE 


Subunits 


Polar (P) 
OR Hydrophobic 
(H) 


Molecular 
WEIGHT (kDa) 


Anti- 
TaGSTI 
antibody 

REACTION 


% 

Enhancement 

BY SAFENER 


TaGSTl-l 


TaGSTla 


P 


25 


+ 


30-50 




TaGSTlb 


P 


25 


+ 


30-50 


























raGSTl-2 


TaGSTla 


P 


25 


+ 


Only 
observed 
with 
saf ener 




TaGSTlb 


P 


25 


+ 




raGST2 


P 


26 


























TaGSTl-3 


TaGSTla 


P 


25 


+ 


Only 

observed 
with 
saf ener 




TaGSTlb 


P 


25 


+ 




TaGSTB 


H 


26 


























raGSTl-4 


TaGSTla 


P 


25 




300% 




TaGSTlb 


P 


25 


+ 


300% 




TaGST4 


H 


24 




300% 



30 
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Table 2 



Activity of GST isoenzymes purified from f enchlorazole- 
ethyl-treated wheat shoots . 

5 Enzyme activities are expressed as nkat.mg-1 



10 



15 



ISOENZYME 


CDNB 


Fluorodi fen 


Fenoxaprop- 

ETHYL 


Metolachlor 












Polar 




















TaGSTl-1 


1, 528 


0. 97 


0 


0. 11 


raGSTl-2 


1, 441 


0.38 


0. 61 


0.25 












Hydrophobic 




















raGSTl-3 


1, 700 


0.38 


0.44 


0,28 


raGSTl-4 


1, 553 


0. 57 


0.23 


0.23 
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Table 3 

CDNB and herbicide ac1:ivit:ies of recombinant: wheat: GSTs 



Activities expressed as nkat.mg-1 ± standard error 



Recombinant enzyme 


CDNB 


Fluorodifen 


FENOXAPROP- ETHYL 


Metolachlor 












TaGSTl 


1970 
± 30 


2.0 
± 0.1 


0 


0. 127 
± 0.014 












WIG 1 


406.5 
± 0.5 


0. 136 
± 0.011 


0.050 
± 0.010 


0.315 
± 0.003 












WIC 2 


187 
± 1 


0. 096 
± 0.002 


0.085 
± 0,002 


0.512 
± 0.04 












WIG 3 


2, 519 
± 88 


0.014 
± 0.006 


0.093 
± 0.002 


0.053 
± 0.004 












WIC 4 


980 
± 86 


0.036 
± 0.004 


0.012 
± 0.001 


0.037 
± 0.003 












WIC 5 


174 
± 8 


0.030 
± 0.002 


0.067 
± 0.003 


0.040 
± 0.004 












TA 27 


237 
± 13 


0.034 
± 0.003 


0.036 
± 0.004 


0.063 
± 0.006 
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EXAMPIiE 2 : Cloning of wheat GSTs resembling the type 1 
GSTs from maize. 

(a) Characterisation of type 1 GSTs in wheat 

The observation that extracts from safener- treated 
wheat shoots contained GSTs which, unlike those described 
in Example 1, were not selectively retained on the 
affinity matrix suggested that a discrete class of GSTs 
were present in this loosely bound fraction. Crude 
extracts of wheat seedlings were analysed by Western 
blotting following SDS-PAGE using a polyclonal rabbit 
antiserum raised to the ZmGSTl~ll heterodimer- The 
antiserum reacted strongly with several polypeptides of 
Mr 23 - 27 kDa . These polypeptides were present in the 
loosely-bound fraction from the S-hexyl-glutathione 
affinity column, but not in the affinity bound fraction* 

(b) Cloning of cDNAs from a wheat expression library 

Having established that saf ener-treated wheat 
shoots contained polypeptides which cross-reacted with 
the antiserum raised to the maize GSTs, the primary cDNA 
expression library prepared from f enchlorazole-ethyl 
treated wheat shoots was screened with the antibody for 
positive clones. Following a screen of 170,000 pfu., ten 
positive plaques were identified, with obvious 
differences in the intensity of recognition, with four 
plaques showing a strong colour reaction and six plaques 
of lower intensity. These cDNA clones were termed WIC 
clones. All four of the stronger-reacting plaques ( P^JC 1, 
2, 4 and 5) and four of the weaker positives (WIC 3, 7, 8 
and 10) were purified to homogeneity, the respective 
.plasmids excised and DNA preparations sequenced. The 
clones were then grouped by their degree of similarity in 
sequence . 



-63- 

In the sequence listing, WIC 1 is SEQ ID No. 3 and 
its deduced amino acid sequence is SEQ ID No. 4. WIC 2 is 
SEQ ID No. 5 and its deduced amino acid sequence is SEQ 
ID No. 6. The coding sequences of WIC 3, WIC 7 and WIC 8 
were identical in sequence. The DNA sequence of WIC 3/7/8 
is given in SEQ ID No. 7 and the deduced amino acid 
sequence in SEQ ID No. 8 All three sequences contained a 
stop codon in the 5' untranslated region of the GST gene, 
although some expression occurred- The DNA sequence of 
WIC 5 is shown in SEQ ID No. 9, and the deduced amino 
acid sequence in SEQ ID No. 10. WIC 4 and WIC 10 had 
identical coding sequences, but differed in their 
untranslated regions. In particular, WIC 10 had a stop 
codon in the 5* untranslated region, though this did not 
prevent all expression of the fusion protein The WIC 4 
DNA sequence is given in SEQ ID No. 11 and the deduced 
WIC 4/10 amino acid sequence in SEQ ID No. 12 (the WIC 10 
DNA sequence is not shown) . 

(c) Cloning of wheat GSTs bv differential screening of a 
cDNA library 

A further cDNA clone, termed TA 27 was obtained. A 
cDNA library prepared from wheat seedlings treated with 
the herbicide safener cloquintocet-mexyl , was screened 
for clones which represented mRNAs which were 
differentially expressed in wheat in response to safener 
application. The identity of the clone as a GST was 
suggested from its nucleotide (SEQ ID No. 13) and deduced 
amino acid (SEQ ID No. 14) sequence. As the coding 
sequence of TA 27 was not in frame with beta- 
galactosidase in the pBluescript vector, the coding 
sequence was sub-cloned into the expression vector pET 
11a (Novagen) , with translation starting at the first ATG 
codon in the clone, which gave a reasonable alignment of 
the open reading frame with that of other GSTs involved 



in herbicide metabolism, notably the ZmGSTlV sequence. 



(d) Activity of recombinant GSTs of the invention 

To confirm that the NIC clones and TA 27 encoded 
functional GSTs the corresponding enzymes were expressed 
as recombinant enzymes in E. coli. The full coding 
sequence of TA 27 was expressed in the pET vector, while 
the WIC clones were expressed as fusions with part of the 
beta-galactosidase enzyme using the pBluescript vector. 
The levels of recombinant protein expressed varied 
between the differing clones. Appreciable amounts of 
recombinant protein were observed in the TA 27 pET clones 
and in clones VJIC i, ¥11C 2, WIC 4 and WIC 5. Western 
blotting of these total bacterial extracts with the 
antiserum raised to Z/nGSTI-II showed that the fusion 
proteins were selectively recognised by the antiserum. In 
contrast, use of the antiserum demonstrated much lower 
levels of expression of immunoreacti ve fusion proteins in 
clones WIC 3, WIC 7, WIC 8 and WIC 10. 



To assay the recombinant fusion proteins for GST 
activity , the E. coli were grown in the presence of IPTG 
and then pelleted by centrif ugation , The bacteria were 
then lysed by sonication and the protein precipitated 
using 80% ammonium sulphate. After resuspension and 
desalting, GSTs were purified by affinity chromatography. 
The WIC 3 fusion protein was purified using 
sulphobromopht ha lein-S- glutathione affinity 
chromatography (Mozer at ai. Biochem. 22, 1983, 1068- 
1072) while the other WIC fusion proteins were purified 
using glutathione-agarose (Mannervik and Guthenberg. 
Methods Enzymol. 77, 1981, 231-235) .The purified enzymes 
were then assayed for GST activities toward herbicides 
(Table 3) and GST activities toward non-herbicide 
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substrates and glutathione peroxidase activities toward 
organic hydroperoxides (Table 4). 



Table 4 



Other GST activities and glutathione peroxidase 
activities . 

Activities expressed as nkat.mg-l ± standard error. 
Peroxidase activities expressed as absorbance change at 
340 nm.mg-1 ± standard error (n=3). N.D not detected. 





CUMENE 

Hydroperoxide 


Benzyl 

isothiocyanate 


Crotonaldehyde 


ethacrynic 

ACID 












WIC 1 


18.6 ± 0.5 


18.0 ± 3.75 


7.1 ± 0.7 


N.D. 












WIC 2 


28.2 ± 1.7 


33.3 ± 4.5 


5.5 ± 1.3 


N.D. 












WIC 3 


1.4 ± 0.3 


9.0 ± 0.5 


6.3 ± 0.9 


N.D. 












WIC 4 


6.2 ± 0.3 


4.2 ± 0.4 


5.5 ± 0.6 


1.4 ± 0.3 












WIC 5 


1.3 ± 0.2 


9.4 ± 2.0 


4.5 ± 0.5 


N.D. 












IC GST 25 


0.7 ± 0.1 


11.8 ± 0 


3.7 ± 0.3 


N.D. 












TA 21 


peroxidase activity as WIC 1 and WIC 2 



I 



-67- 

EXAMPLE 3: A microti tre plate - based screen to identify 
herbicidal molecules which are metabolised by GSTs of the 
invention and may selectively control weeds in a crop of 
wheat or other species such as maize, soybean or rice 

5 

(a) Degradation of candidate herbicides by wheat GSTs and 
relationship to crop and weed selectivity 

10 Herbicidal molecules which are degraded by 

recombinant wheat GSTs may be predicted to be tolerated 
by plants of wheat or other crop species . These 
herbicides may be less rapidly degraded in weeds such as 
black-grass {Alopecurus myosuroides) that it is desirable 

15 to control in a crop of wheat or other species. 

Herbicides found in a laboratory based screen to be 
metabolised by these GSTs are therefore likely to possess 
useful abilities to selectively control troublesome weeds 
in a crop of wheat , or other species such as mai ze , 

20 soybean, rice, cotton, barley, oat, rye, sorhum, 
triticale, potato, sugarcane or sugarbeet. 

(b) A 96 well plate - based assay procedure for 

2 5 identifying novel herbicides degraded by recombinant 
wheat GSTs 

Novel herbicides arising from a chemical synthesis 
programme oriented to optimisation for selective 

30 herbicidal activity and potency may be screened for 

ability to be degraded by a panel of recombinant GSTs 
using a 96 well microplate assay format and subsequent 
reaction analysis by automated High Pressure Liquid 
Chromatography (HPLC) . This allows for example, the 

35 screening of a set of eleven novel herbicides and one 

positive control compound such as CDNB, against a panel 
of seven recombinant GSTs. An eighth file of wells 



contains test compounds but lacks GSTs; these wells serve 
to identify non-enzymic reaction of the test compounds 
with reduced glutathione. Alternatively, the array can be 
configured to screen more test compounds against a more 
limited number of GSTs. For example, fifteen compounds 
can be screened against five GSTs or forty seven 
compounds may be screened with a single mixture of GSTs. 
In all cases, provision is made for a positive control 
and to test for non-enzymic reaction with reduced 
glutathione . 



Enzyme assays are carried out in a total reaction 
volume of 100 microlitres. Each reaction mixture contains 
100 micromolar Tris.HCl buffer, pH 7.8, 500 micromolar 
reduced glutathione and where appropriate, 500 micromolar 
test compound or a reference substrate such as CDNB; and 
14 micrograms of GST protein. The microplate is incubated 
at 30°C on a variable speed agitator for 30 minutes and 
reactions are then stopped by the addition of 15 
microlitres of 23% perchloric acid solution. The 
microplate is then centrifuged at 2000 g for 15 minutes. 



(c) Reaction analysis bv automated High Pressure Liquid 
Chromatography . 



The separation and analysis of glutathione 
conjugates of test herbicides may be carried out using 
High Pressure Liquid Chromatography (HPLC) , for example a 
Gilson HPLC in tandem with corresponding software, for 
example Gilson Version 7 • 12 and fitted with an 
appropriate column, for example a 5 cm Spherisorb 0DS2 
column. Typically, separation may be carried out using a 
two phase solvent system as follows: Phase A: water 
containing 0.1% trif luoroacetic acid and 5.5% 
acetonitrile ; Phase B: 100% acetonitrile ; flow rate 1.5 
ml per minute; injection volume 20 microlitres. 
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The elution gradient may be typically as follows: 
10% phase B for one minute, followed by a linear gradient 
to reach 60% phase B after 8.5 minutes. The gradient is 
further increased to reach 100% phase B at 9 minutes; 
phase B is continued at 100% until 11.5 minutes and is 
then reduced in a linear gradient to 10% at 13.5 minutes. 
A further 1 . 5 minutes at 10% phase B is required to re- 
equilibrate the column. Absorbance signals are detected 
at 2 64 nanometres using a suitable UV detector, and 
detect the glutathione con j ugate of CDNB, having a 
retention time of 2.4 minutes, resolving this from 
unreacted CDNB having a retention time of 4.75 minutes. 
Such conditions also allow for the resolution and 
detection of the glutathione conjugates arising from the 
metabolism of other reference herbicides such as 
metolachlor, fenoxaprop, f enoxaprop-ethyl and fluorodifen 
and also of a variety of novel herbicidal compounds 
identified in the assay. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: RHONE-POULENC AGRICULTURE LIMITED 

(B) STREET: FYFIELD ROAD 

(C) CITY: ONGAR 

(D) STATE: ESSEX 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP): CMS OHW 

(i1) TITLE OF INVENTION: GLUTATHIONE TRANSFERASES 
(iii) NUMBER OF SEQUENCES: 14 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1085 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 46. .711 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION :1. .1085 

(D) OTHER INFORMATION:/note= "SEQUENCE OF TaGSTl AND 
ENCODED AMINO ACID SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CAAACACAAG CACAGATCGG TCGAGATTCA AGGCAACC6G GAGCA ATG GCG GGC 54 

Met Ala Gly 



GAG AAG GGG CTG GTG CTG CTG GAC TTC TGG GTG AGC CCG TTC GGG CAG 102 
Glu Lys Gly Leu Val Leu Leu Asp Phe Trp Val Ser Pro Phe Gly Gin 



* 



-71- 



CGC 


GTG 


CGC 


ATC 


GCG 


CTG 


GCC 


GAG 


AAG 


GGC 


CTG 


CCC 


TAC 


GAG 


TAC 


GCG 


150 


Arg 


Val 


Arg 


lie 


Ala 


Leu 


Ala 


Glu 


Lys 


Gly 


Leu 


Pro 


Tyr 


Glu 


Tyr 


Ala 




GAG 


GAG 


GAC 


CTG 


ATG 


GCC 


GGC 


AAG 


AGC 


GAC 


CGC 


CTC 


CTC 


CGC 


GCC 


AAC 


198 


Glu 


Glu 


Asp 


Leu 


Met 


Ala 


Gly 


Lys 


Ser 


Asp 


Arg 


Leu 


Leu 


Arg 


Ala 


Asn 




CCG 


GTG 


CAT 


AAG 


AAG 


ATC 


CCG 


GTG 


CTC 


CTC 


CAC 


GAC 


GGC 


CGT 


GCC 


GTC 


246 


Pro 


Val 


His 


Lys 


Lys 


He 


Pro 


Val 


Leu 


Leu 


His 


Asp 


Gly 


Arg 


Ala 


Val 




AAC 


GAG 


TCC 


CTC 


ATC 


ATC 


CTC 


CAG 


TAC 


CTG 


GAG 


GAG 


GCC 


TTC 


CCG 


GAC 


294 


Asn 


Glu 


Ser 


Leu 


lie 


He 


Leu 


Gin 


Tyr 


Leu 


Glu 


Glu 


Ala 


Phe 


Pro 


Asp 




GCG 


CCC 


GCT 


CTG 


CTC 


CCC 


TCC 


GAC 


CCC 


TAC 


GCG 


CGC 


GCG 


CAG 


GCC 


CGC 


342 


Ala 


Pro 


Ala 


Leu 


Leu 


Pro 


Ser 


Asp 


Pro 


Tyr 


Ala 


Arg 


Ala 


Gin 


Ala 


Arg 




TTC 


TGG 


GCC 


GAC 


TAC 


GTC 


GAC 


AAG 


AAG 


GTC 


TAC 


GAC 


TGC 


GGC 


TCC 


CGC 


390 


Phe 


Trp 


Ala 


Asp 


Tyr 


Val 


Asp 


Lys 


Lys 


Val 


Tyr 


Asp 


Cys 


Gly 


Ser 


Arg 




CTC 


TGG 


AAG 


CTC 


AAG 


GGC 


GAG 


CCG 


CAG 


GCG 


CAG 


GCG 


CGC 


GCC 


GAG 


ATG 


438 


Leu 


Trp 


Lys 


Leu 


Lys 


Gly 


Glu 


Pro 


Gin 


Ala 


Gin 


Ala 


Arg 


Ala 


Glu 


Met 




CTG 


GAC 


ATC 


CTC 


AAG 


ACC 


CTC 


GAC 


GGC 


GCG 


CTC 


GGG 


GAC 


AAG 


CCC 


TTC 


486 


Leu 


Asp 


He 


Leu 


Lys 


Thr 


Leu 


Asp 


Gly 


Ala 


Leu 


Gly 


Asp 


Lys 


Pro 


Phe 




TTC 


GGC 


GGC 


GAC 


AAG 


nc 


GGG 


TTC 


GTC 


GAC 


GCC 


GCC 


TTC 


GCG 


CCC 


TTC 


534 


Phe 


Gly 


Gly 


Asp 


Lys 


Phe 


Gly 


Phe 


Val 


Asp 


Ala 


Ala 


Phe 


Ala 


Pro 


Phe. 




ACC 


GCG 


TGG 


TTC 


CAC 


AGC 


TAC 


GAG 


AGG 


TAC 


GGC 


GAG 


TTC 


AGC 


CTG 


CCG 


582 


Thr 


Ala 


Trp 


Phe 


His 


Ser 


Tyr 


Glu 


Arg 


Tyr 


Gly 


Glu 


Phe 


Ser 


Leu 


Pro 




GAG 


GTG 


GCG 


CCC 


AAG 


ATC 


GCC 


GCG 


TGG 


GCC 


AAG 


CGC 


TGC 


GGC 


GAG 


CGG 


630 


Glu 


Val 


Ala 


Pro 


Lys 


He 


Ala 


Ala 


Trp 


Ala 


Lys 


Arg 


Cys 


Gly 


Glu 


Arg 




GAG 


AGC 


GTC 


GCC 


AAG 


AGC 


CTC 


TAC 


TCG 


CCG 


GAC 


AAG 


GTG 


TAC 


GAC 


TTC 


678 


Glu 


Ser 


Val 


Ala 


Lys 


Ser 


Leu 


Tyr 


Ser 


Pro 


Asp 


Lys 


Val 


Tyr 


Asp 


Phe 




ATC 


GGC 


CTG 


CTC 


AAG 


AAG 


AAG 


TAC 


GGC 


ATC 


GAG 


TA GGCGCGCCGA 




723 


lie 


Gly 


Leu 


Leu 


Lys 


Lys 


Lys 


Tyr 


Gly 


He 


Glu 















CGGACGGACG GACGGGCCAT GCAGGCGACA GCCGGCCCGC CGTCCGGAGG GAAGCAACAA 783 

ATAAATCAGG GAGCGATTTG GGTGGCCTAC AATGCGTACG TCTGGATAGA GTATTTCTTT 843 

CTTTCTTTCT TCGTGGAATA AAGTGCTCCG TGTGTGTGTG GTTGGTGGTT GTTGGTTGGA 903 

TCAGTCAGTG TGTGTGGGTG CGTGHGTGT ACTCAGTACT CGTGATGTGT GTGTGTGTCA 963 

ATGTGTCAAC CCTGGTCTTC GGTGGGGGCA GCACCGAGTT GCCACCTGCC ATTCCATTTC 1023 

CATTCCGGGC GATGAATAAA TTAAAAAAGA GTCTCATTTG TTTAAAAAAA AAAAAAAAAA 1083 

AA 1085 



(2) INFORMATION FOR SEQ ID NO: 2: 



J 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Gly Glu Lys Gly Leu Val Leu Leu Asp Phe Trp Val Ser Pro 
15 10 15 

Phe Gly Gin Arg Val Arg He Ala Leu Ala Glu Lys Gly Leu Pro Tyr 
20 25 30 

Glu Tyr Ala Glu Glu Asp Leu Met Ala Gly Lys Ser Asp Arg Leu Leu 
35 40 45 

Arg Ala Asn Pro Val His Lys Lys He Pro Val Leu Leu His Asp Gly 
50 55 60 

Arg Ala Val Asn Glu Ser Leu He He Leu Gin Tyr Leu Glu Glu Ala 
65 70 75 80 

Phe Pro Asp Ala Pro Ala Leu Leu Pro Ser Asp Pro Tyr Ala Arg Ala 
85 90 95 

Gin Ala Arg Phe Trp Ala Asp Tyr Val Asp Lys Lys Val Tyr Asp Cys 
100 105 110 

Gly Ser Arg Leu Trp Lys Leu Lys Gly Glu Pro Gin Ala Gin Ala Arg 
115 120 125 

Ala Glu Met Leu Asp lie Leu Lys Thr Leu Asp Gly Ala Leu Gly Asp 
130 135 140 

Lys Pro Phe Phe Gly Gly Asp Lys Phe Gly Phe Val Asp Ala Ala Phe 
145 150 155 160 

Ala Pro Phe Thr Ala Trp Phe His Ser Tyr Glu Arg Tyr Gly Glu Phe 
165 170 175 

Ser Leu Pro Glu Val Ala Pro Lys He Ala Ala Trp Ala Lys Arg Cys 
180 185 190 

Gly Glu Arg Glu Ser Val Ala Lys Ser Leu Tyr Ser Pro Asp Lys Val 
195 200 205 

Tyr Asp Phe He Gly Leu Leu Lys Lys Lys Tyr Gly He Glu 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 865 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 54. .725 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION:!. .865 

(D) OTHER INFORMATION: /note= "WICl SEQUENCE AND ENCODED 
ICl AMINO ACID SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGAACTCAAC CATTGATCTT CAAGAAGCGG AAGCAAACAG AGCAAAAGGT GTG ATG 56 

Met 



GCG GCG CCG GCG GTG AAG GTG TAC GGG TGG GCG ATG TCG CCG TTC GTG- 104 
Ala Ala Pro Ala Val Lys Val lyr Gly Trp Ala Met Ser Pro Phe Val 

GCG CGC GCG CTG CTG TGC CTG GAG GAG GCC GGC GTG GAG TAC GAG CTC 152 
Ala Arg Ala Leu Leu Cys Leu Glu Glu Ala Gly Val Glu Tyr Glu Leu 

GTC CCC ATG AGC CGC GAG GCC GGC GAC CAC CGC CAG CCC GAC TIC CTC 200 
Val Pro Met Ser Arg Glu Ala Gly Asp His Arg Gin Pro Asp Phe Leu 

GCC CGG AAC CCC TTC GGC CAG GTC CCC GTT CTC GAG GAC GGC GAC CTC 248 
Ala Arg Asn Pro Phe Gly Gin Val Pro Val Leu Glu Asp Gly Asp Leu 

ACC ATC nC GAG TCG CGC GCC GTC GCG AGG CAC GTG CTG CGC AAG CAC 296 
Thr He Phe Glu Ser Arg Ala Val Ala Arg His Val Leu Arg Lys His 

AAA CCG GAG CTG CTG GGC TCC GGC TCG CCG GAG TCG GCG GCG ATG GTG 344 
Lys Pro Glu Leu Leu Gly Ser Gly Ser Pro Glu Ser Ala Ala Met Val 

GAC GTG TGG CTG GAG GTG GAG GCC CAC CAG CAC CAG ACC CCG GCG GGC 392 
Asp Val Trp Leu Glu Val Glu Ala His Gin His Gin Thr Pro Ala Gly 

ACC ATC GTC ATG CAG TGC ATC CTC ACC CCG TTC CTC GGC TGC CAG CGC 440 
Thr He Val Met Gin Cys He Leu Thr Pro Phe Leu Gly Cys Gin Arg 

GAC CAG GCC GCC ATC GAC GAG AAC GCG GCA AAG CTG ACG AAT CTG TTC 488 
Asp Gin Ala Ala He Asp Glu Asn Ala Ala Lys Leu Thr Asn Leu Phe 

GAC GTG TAC GAG GCG CGC CTG TCG GCG TCG AGG TAC CTT GCC GGG GAG 536 
Asp Val Tyr Glu Ala Arg Leu Ser Ala Ser Arg Tyr Leu Ala Gly Glu 

GCG GTC AGC CTC GCG GAC CTC AGC CAC TTC CCG TTC ATG CGA TAC TTC 584 
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Ala Val Ser Leu Ala Asp Leu Ser His Phe Pro Phe Met Arg Tyr Phe 

ATG GAC ACC GAG TAC GCG TCG CTG GTG GAG GAG CGC CCG CAC GTG AAG 632 
Met Asp Thr Glu Tyr Ala Ser Leu Val Glu Glu Arg Pro His Val Lys 

GCG TGG TGG GAG GAG TTC AAG GCC AGC CCG GCG GCG AAG AGG GTG ACG 680 
Ala Trp Trp Glu Glu Phe Lys Ala Ser Pro Ala Ala Lys Arg Val Thr 

GAG TTC ATG CCG CCA AAC TTC GGG TTC GGA AAG AAG GCA GAG AAG 725 
Glu Phe Met Pro Pro Asn Phe Gly Phe Gly Lys Lys Ala Glu Lys 

TGATGACAAG AACGAACACC GAGCGAACAT GTTGTGTGGT CTGTGCGACC CGACCATGGC 785 

TCAATGTTTT GGGCTGTTTG TGTTTCACGC ATGAAT6AAT AAAACAAAAT GCTTTTGGGT 845 

TTCAAAAAAA AAAAAAAAAA 865 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Ala Pro Ala Val Lys Val Tyr Gly Trp Ala Met Ser Pro Phe 
15 10 15 

Val Ala Arg Ala Leu Leu Cys Leu Glu Glu Ala Gly Val Glu Tyr Glu 
20 25 30 

Leu Val Pro Met Ser Arg Glu Ala Gly Asp His Arg Gin Pro Asp Phe 
35 40 45 

Leu Ala Arg Asn Pro Phe Gly Gin Val Pro Val Leu Glu Asp Gly Asp 
50 55 60 

Leu Thr He Phe Glu Ser Arg Ala Val Ala Arg His Val Leu Arg Lys 
65 70 75 80 

His Lys Pro Glu Leu Leu Gly Ser Gly Ser Pro Glu Ser Ala Ala Met 
85 90 95 

Val Asp Val Trp Leu Glu Val Glu Ala His Gin His Gin Thr Pro Ala 
100 105 110 

Gly Thr He Val Met Gin Cys He Leu Thr Pro Phe Leu Gly Cys Gin 
115 120 125 

Arg Asp Gin Ala Ala He Asp Glu Asn Ala Ala Lys Leu Thr Asn Leu 
130 135 140 
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Phe Asp Val Tyr Glu Ala Arg Leu Ser Ala Ser Arg Tyr Leu Ala Gly 
145 150 155 160 

Glu Ala Val Ser Leu Ala Asp Leu Ser His Phe Pro Phe Met Arg Tyr 
165 170 175 

Phe Met Asp Thr Glu Tyr Ala Ser Leu Val Glu Glu Arg Pro His Val 
180 185 190 

Lys Ala Trp Trp Glu Glu Phe Lys Ala Ser Pro Ala Ala Lys Arg Val 
195 200 205 

Thr Glu Phe Met Pro Pro Asn Phe Gly Phe Gly Lys Lys Ala Glu Lys 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 60.. 725 

(ix) FEATURE: 

(A) NAME/KEY: niisc_feature 

(B) LOCATION :1. .930 

(D) OTHER INFORMATION: /note= "WIC2 SEQUENCE AND ENCODED 
IC2 AMINO ACID SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CACGCGTCCA TCTCCAAGAA GCGGAAGCTA GTGGAGCAGA GCAAACCAAG CAAGGTTGG 59 



ATG 


GCG 


CCG 


GCG 


GTG 


AAG 


GTG 


TAC 


GGG 


TGG 


GCC 


GTG 


TCG 


CCG 


TTC 


GTG 


107 


Met 


Ala 


Pro 


Ala 


Val 


Lys 


Val 


Tyr 


Gly 


Trp 


Ala 


Val 


Ser 


Pro 


Phe 


Val 




GCG 


CGC 


CCA 


CTG 


CTG 


TGC 


CTG 


GAG 


GAG 


GCC 


GGC 


GTC 


GAG 


TAC 


GAG 


CTC 


155 


Ala 


Arg 


Pro 


Leu 


Leu 


Cys 


Leu 


Glu 


Glu 


Ala 


Gly 


Val 


Glu 


Tyr 


Glu 


Leu 




GTG 


TCC 


ATG 


AGC 


CGC 


GCG 


GCC 


GGC 


GAC 


CAC 


CGC 


CAG 


CCG 


GAC 


TTC 


CTC 


203 


Val 


Ser 


Met 


Ser 


Arg 


Ala 


Ala 


Gly 


Asp 


His 


Arg 


Gin 


Pro 


Asp 


Phe 


Leu 




GCC 


CGG 


AAC 


CCC 


TTC 


GGC 


CAG 


GTC 


CCC 


GTC 


CTC 


GAG 


GAC 


GGC 


GAC 


CTC 


251 


Ala 


Arg 


Asn 


Pro 


Phe 


Gly 


Gin 


Val 


Pro 


Val 


Leu 


Glu 


Asp 


Gly 


Asp 


Leu 




ACC 


CTC 


nc 


GAG 


TCG 


CGC 


GCG 


ATC 


GCG 


AGG 


CAC 


GTG 


CTC 


CGG 


AAG 


CAC 


299 



I 



-76- 

Thr Leu Phe Glu Ser Arg Ala He Ala Arg His Val Leu Arg Lys His 

AAG CCG GAG CTG CTG GGC TGC GGC TCG CCG GAG GCG GAG GCG ATG GTG 347 
Lys Pro Glu Leu Leu Gly Cys Gly Ser Pro Glu Ala Glu Ala Met Val 

GAC GTG TGG CTG GAG GTG GAG GCC CAC CAG TAG AAC CCC GCG GCC AGC 395 
Asp Val Trp Leu Glu Val Glu Ala His Gin Tyr Asn Pro Ala Ala Ser 

GCC ATC GTG GTG CAG TGC ATC ATC TTG CCG CTA CTG GGC GGC GCG CGG 443 
Ala He Val Val Gin Cys He He Leu Pro Leu Leu Gly Gly Ala Arg 

GAC CAG GCG GTG GTG GAC GAG AAC GTA GCC AAG CTC AAG AAG GTG CTG 491 
Asp Gin Ala Val Val Asp Glu Asn Val Ala Lys Leu Lys Lys Val Leu 

GAG GTG TAC GAG GCA CGG CTG TCG GCG TCC AGG TAC CTC GCC GGG GAC 539 
Glu Val Tyr Glu Ala Arg Leu Ser Ala Ser Arg Tyr Leu Ala Gly Asp 

GAC ATC AGC CTC GCC GAC CTC AGC CAC TTC CCC TTC ACG CGC TAC TTC 587 
Asp He Ser Leu Ala Asp Leu Ser His Phe Pro Phe Thr Arg Tyr Phe 

ATG GAG ACG GAG TAC GCG CCG CTG GTG GCG GAG CTC CCC CAC GTG AAC 635 
Met Glu Thr Glu Tyr Ala Pro Leu Val Ala Glu Leu Pro His Val Asn 

GCG TGG TGG GAG GGG CTC AAG GCC AGG CCG GCC GCG AGG AAG GTG ACG 683 
Ala Trp Trp Glu Gly Leu Lys Ala Arg Pro Ala Ala Arg Lys Val Thr 

GAG CTC ATG CCG CCG GAC CTT GGG CTT GGA AAG AAA GCA GAG 725 
Glu Leu Met Pro Pro Asp Leu Gly Leu Gly Lys Lys Ala Glu 

TAGTGATGAC TGCCGCCAAC GTTCACCAGG ATCGAGCAAG TCACTGTCGA GTCTCCGGTT 785 

TTGCGTTGTA CGGCACCGGG GCACC6GCCT ATATTTTCTG TACCAGTGGC TCGTGTTTTG 845 

ATGTTTTAGT CTCACGCTTG AATAAAATGC AAGATATACC CATCGGTTCT AAAAGAAAAA 905 

AAAAAAAAAA AAAAAAAAAA AAAAA 930 



(2) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Pro Ala Val Lys Val Tyr Gly Trp Ala Val Ser Pro Phe Val 
15 10 15 

Ala Arg Pro Leu Leu Cys Leu Glu Glu Ala Gly Val Glu Tyr Glu Leu 
20 25 30 

Val Ser Met Ser Arg Ala Ala Gly Asp His Arg Gin Pro Asp Phe Leu 
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35 40 45 

Ala Arg Asn Pro Phe Gly Gin Val Pro Val Leu Glu Asp Gly Asp Leu 
50 55 60 

Thr Leu Phe Glu Ser Arg Ala He Ala Arg His Val Leu Arg Lys His 
65 70 75 80 

Lys Pro Glu Leu Leu Gly Cys Gly Ser Pro Glu Ala Glu Ala Met Val 
85 90 95 

Asp Val Trp Leu Glu Val Glu Ala His Gin Tyr Asn Pro Ala Ala Ser 
100 105 110 

Ala He Val Val Gin Cys lie He Leu Pro Leu Leu Gly Gly Ala Arg 
115 120 125 

Asp Gin Ala Val Val Asp Glu Asn Val Ala Lys Leu Lys Lys Val Leu 
130 135 140 

Glu Val Tyr Glu Ala Arg Leu Ser Ala Ser Arg Tyr Leu Ala Gly Asp 
145 150 155 160 

Asp He Ser Leu Ala Asp Leu Ser His Phe Pro Phe Thr Arg Tyr Phe 
165 170 175 

Met Glu Thr Glu Tyr Ala Pro Leu Val Ala Glu Leu Pro His Val Asn 
180 185 190 

Ala Trp Trp Glu Gly Leu Lys Ala Arg Pro Ala Ala Arg Lys Val Thr 
195 200 205 

Glu Leu Met Pro Pro Asp Leu Gly Leu Gly Lys Lys Ala Glu 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 72. .707 

(ix) FEATURE: 

(A) NAME/KEY: niisc_feature 

(B) LOCATION :1. .927 

(D) OTHER INFORMATION :/note= "WIC 3/7/8 SEQUENCE AND 
ENCODED IC3 AMINO ACID SEQUENCE" 



J 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 7: 

AGCGGCTTTA CCTACCGAGA AGAAGAGAGA AAAAAGGTTC GAGTGCGTTC CAGAGTGAGG 60 

AGTGAGAAGA G ATG GCT CCG GTG AAG CTG TAC GGC GCG ACC CTG TCG TGG 110 
Met Ala Pro Val Lys Leu Tyr Gly Ala Thr Leu Sen Trp 

AAC GTC ACC AGG TGC GTG GCG GCG CTG GAG GAG GCC GGC GTC CAG TAC 158 
Asn Val Thr Arg Cys Val Ala Ala Leu Glu Glu Ala Gly Val Gin Tyr 

GAG ATC GTA CCC ATC AAC TTC GGC ACC GGC GAG CAC AAG AGC CCC GAC 206 
Glu He Val Pro He Asn Phe Gly Thr Gly Glu His Lys Ser Pro Asp 

CAC CTC GCC AGG AAC CCC TTC GGC CAG GTG CCA GCT TTG CAG GAT GGT 254 
His Leu Ala Arg Asn Pro Phe Gly Gin Val Pro Ala Leu Gin Asp Gly 

GAC TTA TAC GTC TTC GAA TCA CGT GCT ATT TGC AAG TAC GCG TGC CGC 302 
Asp Leu Tyr Val Phe Glu Ser Arg Ala He Cys Lys Tyr Ala Cys Arg 

AAG AAC AAG CCA GAG CTG TTG AAG GAG GGC GAC ATC AAG GAG TCA GCA 350 
Lys Asn Lys Pro Glu Leu Leu Lys Glu Gly Asp He Lys Glu Ser Ala 

ATG GTG GAT GTG TGG CTC GAG GTG GAG GCC CAT CAG TAC ACT GCC GCT 398 
Met Val Asp Val Trp Leu Glu Val Glu Ala His Gin Tyr Thr Ala Ala 

CTG AGC CCC ATT CTC TTC GAG TGC CTT ATC CAT CCA ATG CTT GGG GGA 446 
Leu Ser Pro He Leu Phe Glu Cys Leu He His Pro Met Leu Gly Gly 

GCC ACT GAC CAG AAG GTC ATC GAC GAC AAC CTT GTT AAG ATC AAG AAC 494 
Ala Thr Asp Gin Lys Val He Asp Asp Asn Leu Val Lys He Lys Asn 

GTG CTG GCG GTG TAC GAG GCG CAC CTG AGC AAG TCC AAG TAC CTG GCT 542 
Val Leu Ala Val Tyr Glu Ala His Leu Ser Lys Ser Lys Tyr Leu Ala 

GGA GAC TTC CTC AGT CTT GCG GAC CTT AAC CAT GTG TCT GTC ACC CTG 590 
Gly Asp Phe Leu Ser Leu Ala Asp Leu Asn His Val Ser Val Thr Leu 

TGC TTG GCG GCT ACA CCC TAT GCG TCT CTG TTC GAC GCG TAC CCG CAT 638 
Cys Leu Ala Ala Thr Pro Tyr Ala Ser Leu Phe Asp Ala Tyr Pro His 

GTG AAG GCC TGG TGG ACT GAC CTG CTG GCG AGG CCG TCC GTC CAG AAG 686 
Val Lys Ala Trp Trp Thr Asp Leu Leu Ala Arg Pro Ser Val Gin Lys 

GTC GCA GCG CTG ATG AAG CCA TGATCTTAAT TGCTGGTGCT CGTTCGTCGC 737 
Val Ala Ala Leu Met Lys Pro 



GAAATAAGCC GAGGTGTGTG CCCCCCGATG TGTGCCTGTA CGAGTGTGTG TTCTTGTGAT 797 
GTCTCCTCGT GTTGAATGTT CAGGCTTGTG CTTGCGATCC TGTCTCATCT TTTACTGAAA 857 
TGAGCGTTCC TATGCTCTGG TTTAATAATA AATTGTGCCT AGATATTATC TCAAAAAAAA 917 
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AAAAAAAAAA ^27 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Pro Val Lys Leu Tyr Gly Ala Thr Leu Ser Trp Asn Val Thr 
1 5 10 15 

Arg Cys Val Ala Ala Leu Glu Glu Ala Gly Val Gin Tyr Glu He Val 
20 25 30 

Pro He Asn Phe Gly Thr Gly Glu His Lys Ser Pro Asp His Leu Ala 
35 40 45 

Arg Asn Pro Phe Gly Gin Val Pro Ala Leu Gin Asp Gly Asp Leu Tyr 
50 55 60 

Val Phe Glu Ser Arg Ala He Cys Lys Tyr Ala Cys Arg Lys Asn Lys 
65 70 75 80 

Pro Glu Leu Leu Lys Glu Gly Asp He Lys Glu Ser Ala Met Val Asp 
85 90 95 

Val Trp Leu Glu Val Glu Ala His Gin Tyr Thr Ala Ala Leu Ser Pro 
100 105 110 

He Leu Phe Glu Cys Leu He His Pro Met Leu Gly Gly Ala Thr Asp 
115 120 125 

Gin Lys Val He Asp Asp Asn Leu Val Lys He Lys Asn Val Leu Ala 
130 135 140 

Val Tyr Glu Ala His Leu Ser Lys Ser Lys Tyr Leu Ala Gly Asp Phe 
145 150 155 160 

Leu Ser Leu Ala Asp Leu Asn His Val Ser Val Thr Leu Cys Leu Ala 
165 170 175 

Ala Thr Pro Tyr Ala Ser Leu Phe Asp Ala Tyr Pro His Val Lys Ala 
180 185 190 

Trp Trp Thr Asp Leu Leu Ala Arg Pro Ser Val Gin Lys Val Ala Ala 
195 200 205 

Leu Met Lys Pro 
210 



I 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 45. .683 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION:!. .866 

(D) OTHER INFORMATION: /note= "WIC5 SEQQUENCE AND ENCODED 
IC5 AMINO ACID SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAAGCAGGCA ACAGGCGAGC AGGAAGGAAG CAAGAGAGGT GGAG ATG GCG CCC ATC 56 

Met Ala Pro He 



AAG CTG TAC GGG ATG ATG CTG TCG GCC AAC GTG ACC CGC GTG ACC ACG 104 
Lys Leu Tyr Gly Met Met Leu Ser Ala Asn Val Thr Arg Val Thr Thr 

CTG CTC AAC GAG CTC GGC CTC GAG TTC GAC TTC GTC GAC GTC GAC CTC 152 
Leu Leu Asn Glu Leu Gly Leu Glu Phe Asp Phe Val Asp Val Asp Leu 

CGC ACC GGC GCC CAC AAG CAC CCC GAC TTC CTC AAG CTC AAC CCT TTC 200 
Arg Thr Gly Ala His Lys His Pro Asp Phe Leu Lys Leu Asn Pro Phe 

GGC CAG ATC CCC GCG CTG CAG GAC GGA GAC GAA GTT GTC TTC GAG TCG 248 
Gly Gin He Pro Ala Leu Gin Asp Gly Asp Glu Val Val Phe Glu Ser 

CGC GCC ATC AAC CGG TAC ATC GCG ACC AAG TAC GGG GCG TCC CTG CTG 296 
Arg Ala He Asn Arg Tyr He Ala Thr Lys Tyr Gly Ala Ser Leu Leu 

CCG ACG CCG TCG GCC AAG CTG GAG GCG TGG CTG GAG GTG GAG TCG CAC 344 
Pro Thr Pro Ser Ala Lys Leu Glu Ala Trp Leu Glu Val Glu Ser His 

CAC TTC TAC CCG CCG GCG CGG ACG CTG GTG TAC GAG CTG GTC ATC AAG 392 
His Phe Tyr Pro Pro Ala Arg Thr Leu Val Tyr Glu Leu Val He Lys 

CCC ATG CTG GGC GCC CCC ACC GAC GCC GCC GAG GTG GAC AAG AAC GCC 440 
Pro Met Leu Gly Ala Pro Thr Asp Ala Ala Glu Val Asp Lys Asn Ala 

GCC GAC CTC GCC AAG CTG CTC GAC GTC TAC GAG GCC CAC CTC GCC GCC 488 
Ala Asp Leu Ala Lys Leu Leu Asp Val Tyr Glu Ala His Leu Ala Ala 



-81- 



GGG 


AAC 


AAG 


TAC CTG 


GCC 


GGC 


GAC 


GCC 


TTC CCG 


CTC 


GCC 


GAC 


GCC 


AAC 


Gly 


Asn 


Lys 


Tyr Leu 


Ala 


biy 


Asp 


AT a 


ruQ riO 


Leu 


M 1 d 


Asp 


Al 
M 1 a 


Asn 


CAC 


ATG 


TCC 


TAC CTC 


TTC 


ATG 


CTC 


ACC 


AAG AGC 


CCC 


AAG 


GCG 


GAC 


CTG 


His 


Met 


Ser 


Tyr Leu 


Phe 


Met 


Leu 


Thr 


Lys Ser 


Pro 


Lys 


Ala 


Asp 


Leu 


GTG 


GCC 


TCC 


CGC CCG 


CAC 


GTC 


AAG 


GCC 


TGG TGG 


GAG 


GAG 


ATC 


TCC 


GCC 


Val 


Ala 


Ser 


Arg Pro 


His 


Val 


Lys 


Ala 


Trp Trp 


Glu 


Glu 


lie 


Ser 


Ala 


CGC 


CCC 


GCC 


TGG GCC 


AAG 


ACC 


GTC 


GCC 


TCC ATC 


CCC 


CTC 


CCG 


CCC 


GCC 


Arg 


Pro 


Ala 


Trp Ala 


Lys 


Thr 


Val 


Ala 


Ser He 


Pro 


Leu 


Pro 


Pro 


Ala 



536 



584 



632 



680 



GTC TGAGGTTGCT TGTTTGGCTG CGGCGAGAAC GGAATAAAAT CGCGATGATG 733 
Val 



GAATAAACAA CTTTTTAGAG AGGAAGCTTG GAATTCTTGG TGTTGCTGCT GTTGAATGTT 793 
GAATCHGGT GTTGAATGTT TACGGCACAT CTAATTTATC CAGTTTTTTT GGCGTGAAAA 853 
AAAAAAAAAA AAA 866 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Pro He Lys Leu Tyr Gly Met Met Leu Ser Ala Asn Val Thr 
15 10 15 

Arg Val Thr Thr Leu Leu Asn Glu Leu Gly Leu Glu Phe Asp Phe Val 
20 25 30 

Asp Val Asp Leu Arg Thr Gly Ala His Lys His Pro Asp Phe Leu Lys 
35 40 45 

Leu Asn Pro Phe Gly Gin He Pro Ala Leu Gin Asp Gly Asp Glu Val 
50 55 60 

Val Phe Glu Ser Arg Ala He Asn Arg Tyr He Ala Thr Lys Tyr Gly 
65 70 75 80 

Ala Ser Leu Leu Pro Thr Pro Ser Ala Lys Leu Glu Ala Trp Leu Glu 
85 90 95 

Val Glu Ser His His Phe Tyr Pro Pro Ala Arg Thr Leu Val Tyr Glu 
100 105 110 

Leu Val He Lys Pro Met Leu Gly Ala Pro Thr Asp Ala Ala Glu Val 
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115 120 125 

Asp Lys Asn Ala Ala Asp Leu Ala Lys Leu Leu Asp Val Tyr Glu Ala 
130 135 140 

His Leu Ala Ala Gly Asn Lys Tyr Leu Ala Gly Asp Ala Phe Pro Leu 
145 150 155 160 

Ala Asp Ala Asn His Met Ser Tyr Leu Phe Met Leu Thr Lys Ser Pro 
165 170 175 

Lys Ala Asp Leu Val Ala Ser Arg Pro His Val Lys Ala Trp Trp Glu 
180 185 190 

Glu He Ser Ala Arg Pro Ala Trp Ala Lys Thr Val Ala Ser He Pro 
195 200 205 

Leu Pro Pro Ala Val 
210 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 15. .668 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION :1. .897 

(D) OTHER INFORMATION :/note= "WIC4 SEQUENCE AND ENCODED 
IC4 AMINO ACID SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AACCAAGGGA AACA ATG GCG CCG GTG AAG GTG TTC GGG CCG GCG ATG TCG 50 
Met Ala Pro Val Lys Val Phe Gly Pro Ala Met Ser 

ACC AAC GTG GCC CGG GTG CTG GTG TGC CTG GAG GAG GTC GGC GCC GAG 98 
Thr Asn Val Ala Arg Val Leu Val Cys Leu Glu Glu Val Gly Ala Glu 

TAC GAG GTG GTC GAC ATC GAT TTC AAG GCC ATG GAG CAC AAG AGC CCC 146 
Tyr Glu Val Val Asp He Asp Phe Lys Ala Met Glu His Lys Ser Pro 

GAG CAT CTC GTC AGA AAC CCG TTC GGC CAA ATC CCT GCC TTC CAG GAT 194 
Glu His Leu Val Arg Asn Pro Phe Gly Gin He Pro Ala Phe Gin Asp 
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GGG 


GAT 


GTG 


CTT 


CTL 


I 1 L 


PAP 

bAb 


TP A 

1 LA 


LbL 


PP A 

bbA 


ATT 
A 1 1 


rrr 

bLb 


Abb 


TAP 
1 ML 


r;Tr; 


PTP 
L 1 L 


9A9 


Gly 


Asp 


Leu 


Leu 


Leu 


Phe 


Glu 


Ser 


Arg 


Ala 


He 


Ala 


Arg 


Tyr 


Val 


Leu 




CGC 


AAG 


TAG 


AAG 


AAG 


AAC 


P A A 

GAA 


PXP 

GTb 


PAP 

bAL 


PTP 

Lib 


PTP 

Lib 


APP 

Abb 


PA A 

bAA 


rrr 
bbL 


PAP 
bAL 


PTP 
L 1 L 


c.^\J 


Arg 


Lys 


Tyr 


Lys 


Lys 


Asn 


Glu 


Val 


Asp 


Leu 


Leu 


Arg 


Glu 


Gly 


Asp 


Leu 




AAG 


GAG 


GCG 


GCG 


ATG 


GTG 


GAC 


GTA 


TPP 

Tbb 


A PP 

ALb 


PAP 

bAb 


PTP 

bib 


PAP 

bAL 


rrr 
bLb 


PAT 
LAL 


APP 
ALL 


OoO 




Glu 


Ala 


Ala 


Met 


Val 


Asp 


Val 


Trp 


Thr 


Glu 


Val 


Asp 


Ala 


His 


Thr 




TAG 


AAC 


GCG 


GCC 


ATC 


TCG 


CCG 


ATP 

ATC 


PTP 

GTG 


TAP 

TAL 


PAP 

bAb 


TPP 

IbL 


TP A 

1 LA 


TP A 

1 LA 


APP 

ALL 


PPT 
bL 1 




Tyr 


Asn 


Pro 


Ala 


He 


Ser 


Pro 


He 


Val 


Tyr 


Glu 


Cys 


Ser 


Ser 


Thr 


Ala 




CAT 


GCG 


CGG 


GTG 


CCG 


ACC 


AAC 


CAA 


A PP 

ACG 


GIG 


PTP 

bib 


PAP 

bAL 


PAP 

bAb 


APP 

AbL 


PTP 
L 1 b 


PAf^ 
bAb 




His 


Ala 


Arg 


Leu 


Pro 


Thr 


Asn 


Gin 


Thr 


Val 


Val 


Asp 


Glu 


Ser 


Leu 


Glu 




AAG 


CTC 


AAG 


AAC 


GTG 


CTG 


GAG 


GTC 


TAG 


GAG 


GCG 


CGC 


CTG 


TCC 


A A/^ 

AAG 


A P 

CAC 




Lvs 


Leu 


Lys 


Asn 


Val 


Leu 


Glu 


Val 


Tyr 


Glu 


Ala 


Arg 


Leu 


Ser 


Lys 


His 




GAC 


TAG 


CTC 


GCC 


GGG 


GAC 


TTC 


GTC 


AGC 


TTC 


GCG 


GAC 


^TP 

CTC 


A AP 

AAL 


PAP 
LAL 


TTP 
1 1 L 


can 


Asp 


Tvr 


Leu 


Ala 


Gly 


Asp 


Phe 


Val 


Ser 


Phe 


Ala 


Asp 


Leu 


Asn 


His 


Phe 




CGC 


TAG 


ACC 


TTC 


TAG 


TTC 


ATG 


GCC 


ACG 


CCG 


CAC 


GCG 


GCC 


CTC 


TTC 


GAC 


578 


Pro 


Tyr 


Thr 


Phe 


Tyr 


Phe 


Met 


Ala 


Thr 


Pro 


His 


Ala 


Ala 


Leu 


Phe 


Asp 




TCG 


TAG 


CCG 


CAC 


GTG 


AAG 


GCC 


TGG 


TGG 


GAG 


AGG 


ATC 


ATG 


GCG 


AGG 


CCG 


626 


Sen 


Tyr 


Pro 


His 


Val 


Lys 


Ala 


Trp 


Trp 


Glu 


Arg 


lie 


Met 


Ala 


Arg 


Pro 




GCC 


GTG 


AAG 


AAG 


CTC 


GCC 


GCG 


CAG 


ATG 


GTT 


CCG 


AAG 


AAG 


CCG 






668 


Ala 


Val 


Lys 


Lys 


Leu 


Ala 


Ala 


Gin 


Met 


Val 


Pro 


Lys 


Lys 


Pro 









TGATTTGCTA GGCGGGATCT CGCATCGTGG GATCCGATTC CGATCACTGA TCTGTGTGGC 728 



6TTTTCTTTT CTTGTTGGTG TCGCGAATAA GGCAAATGAG CTCGTGTGTG TGTGGCTGGA 788 
ATTGCACCAG CGTGCAGTTT TTGCGCTTTG CGTGTGTGTG GTCGTGAAAA CTCTTGAGAT 848 
GGAACAATGT CTTCGTAATG CTTTCACATT TTAAAAAAAA AAAAAAAAA 897 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Pro Val Lys Val Phe Gly Pro Ala Met Ser Thr Asn Val Ala 
15 10 15 

Arg Val Leu Val Cys Leu Glu Glu Val Gly Ala Glu Tyr Glu Val Val 
20 25 30 



i 
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Asp He Asp Phe Lys Ala Met Glu His Lys Ser Pro Glu His Leu Val 
35 40 45 

Arg Asn Pro Phe Gly Gin He Pro Ala Phe Gin Asp Gly Asp Leu Leu 
50 55 60 

Leu Phe Glu Ser Arg Ala He Ala Arg Tyr Val Leu Arg Lys Tyr Lys 
65 70 75 80 

Lys Asn Glu Val Asp Leu Leu Arg Glu Gly Asp Leu Lys Glu Ala Ala 
85 90 95 

Met Val Asp Val Trp Thr Glu Val Asp Ala His Thr Tyr Asn Pro Ala 
100 105 110 

He Ser Pro He Val Tyr Glu Cys Ser Ser Thr Ala His Ala Arg Leu 
115 120 125 

Pro Thr Asn Gin Thr Val Val Asp Glu Ser Leu Glu Lys Leu Lys Asn 
130 135 140 

Val Leu Glu Val Tyr Glu Ala Arg Leu Ser Lys His Asp Tyr Leu Ala 
145 150 155 160 

Gly Asp Phe Val Ser Phe Ala Asp Leu Asn His Phe Pro Tyr Thr Phe 
165 170 175 

Tyr Phe Met Ala Thr Pro His Ala Ala Leu Phe Asp Ser Tyr Pro His 
180 185 190 

Val Lys Ala Trp Trp Glu Arg He Met Ala Arg Pro Ala Val Lys Lys 
195 200 205 

Leu Ala Ala Gin Met Val Pro Lys Lys Pro 
210 215 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 21.. 686 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .721 
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(D) OTHER INFORMATION: /note= "TA 27 SEQUENCE AND ENCODED 
AMINO ACID SEQUENCE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

nCGGCACGA GGAAGAA6GG ATG GAG CCT ATG AAG GTG TAC GGC TGG GCG 50 

Met Glu Pro Met Lys Val Tyr Gly Trp Ala 

GTG TCG CCA TGG ATG GCG CGG GTC CTC GTC TCC CTG GAG GAG GCC GGC 98 
Val Ser Pro Trp Met Ala Arg Val Leu Val Ser Leu Glu Glu Ala Gly 

GCC GAC TAC GAG CTC GTG CCC ATG AGC CGC AAC GGC GGC GAC CAC CGG 146 
Ala Asp Tyr Glu Leu Val Pro Met Ser Arg Asn Gly Gly Asp His Arg 

CGG CCG GAG CAC CTC GCC AGA AAC CCC TIC GGT GAG ATC CCG GTG CTC 194 
Arg Pro Glu His Leu Ala Arg Asn Pro Phe Gly Glu He Pro Val Leu 

GAA TAC GGC GGT CTG ACG CTT TAC CAA TCC CGC GCC ATI GCA AGG CAT 242 
Glu Tyr Gly Gly Leu Thr Leu Tyr Gin Ser Arg Ala He Ala Arg His 

ATT CTC CGC AAA CAC AAG CCC GGG CTT CTA GGA GCA GGC AGC CTC GAG 290 
He Leu Arg Lys His Lys Pro Gly Leu Leu Gly Ala Gly Ser Leu Glu 

GAG TCG GCG ATG GTG GAT GTA TGG GTC GAC GTG GAT GCC CAC CAC CTG 338 
Glu Ser Ala Met Val Asp Val Trp Val Asp Val Asp Ala His His Leu 

GAG CCC GTA CTC AAG CCC ATC GTG TGG AAC TGC ATC ATC AAC CCG TTC 386 
Glu Pro Val Leu Lys Pro He Val Trp Asn Cys He He Asn Pro Phe 

GTC GGG AGG GAC GTC GAC CAG GGC CTC GTC GAT GAG AGC GTC GAG AAG 434 
Val Gly Arg Asp Val Asp Gin Gly Leu Val Asp Glu Ser Val Glu Lys 

CTC AAG AAG CTG CTG GAG GTG TAC GAG GCA AGA CTG TCA AGC AAC AAG 482 
Leu Lys Lys Leu Leu Glu Val Tyr Glu Ala Arg Leu Ser Ser Asn Lys 

TAC TTG GCC GGG GAT TTC GTC AGC TTC GCC GAC CTC ACC CAT TTC TCC 530 
Tyr Leu Ala Gly Asp Phe Val Ser Phe Ala Asp Leu Thr His Phe Ser 

TTC ATG CGC TAC TTC ATG GCG ACG GAG CAT GCG GTT GTG CTC GAT GCG 578 
Phe Met Arg Tyr Phe Met Ala Thr Glu His Ala Val Val Leu Asp Ala 

TAT CCG CAT GTG AAG GCA TGG TGG AAG GCG CTG CTG GCA AGG CCA TCG 626 
Tyr Pro His Val Lys Ala Trp Trp Lys Ala Leu Leu Ala Arg Pro Ser 

GTC AAG AAG GTG ATA GCT GGC ATG CCT CCG GAT TTT GGA TTC GGG AGC 674 
Val Lys Lys Val He Ala Gly Met Pro Pro Asp Phe Gly Phe Gly Ser 

GGG AGA ATA CCA TGATAAAGCA TGCTTGTTTG TCTATGATGC TCTGA 721 
Gly Arg He Pro 



(2) INFORMATION FOR SEQ ID NO: 14: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Glu Pro Met Lys Val Tyr Gly Trp Ala Val Ser Pro Trp Met Ala 
15 10 15 

Arg Val Leu Val Ser Leu Glu Glu Ala Gly Ala Asp Tyr Glu Leu Val 
20 25 30 

Pro Met Ser Arg Asn Gly Gly Asp His Arg Arg Pro Glu His Leu Ala 
35 40 45 

Arg Asn Pro Phe Gly Glu He Pro Val Leu Glu Tyr Gly Gly Leu Thr 
50 55 60 

Leu Tyr Gin Ser Arg Ala lie Ala Arg His He Leu Arg Lys His Lys 
65 70 75 80 

Pro Gly Leu Leu Gly Ala Gly Ser Leu Glu Glu Ser Ala Met Val Asp 
85 90 95 

Val Trp Val Asp Val Asp Ala His His Leu Glu Pro Val Leu Lys Pro 
100 105 110 

He Val Trp Asn Cys He He Asn Pro Phe Val Gly Arg Asp Val Asp 
115 120 125 

Gin Gly Leu Val Asp Glu Ser Val Glu Lys Leu Lys Lys Leu Leu Glu 
130 135 140 

Val Tyr Glu Ala Arg Leu Ser Ser Asn Lys Tyr Leu Ala Gly Asp Phe 
145 150 155 160 

Val Ser Phe Ala Asp Leu Thr His Phe Ser Phe Met Arg Tyr Phe Met 
165 170 175 

Ala Thr Glu His Ala Val Val Leu Asp Ala Tyr Pro His Val Lys Ala 
180 185 190 

Trp Trp Lys Ala Leu Leu Ala Arg Pro Ser Val Lys Lys Val He Ala 
195 200 205 

Gly Met Pro Pro Asp Phe Gly Phe Gly Ser Gly Arg He Pro 
210 215 220 



CLAIMS 



1. A polynucleotide encoding a glutathione transferase 
(GST) subunit, which polynucleotide comprises a coding 
sequence capable of hybridising selectively to the coding 
sequence of SEQ ID No . 1, 3, 5, 7, 9, 11 or 13 or to the 
complement of one of those sequences. 

2. A polynucleotide of claim 1 which is a DNA 
sequence . 

3. A polynucleotide according to claim 1 or 2 wherein 
the coding sequence encodes the amino acid sequence of 
SEQ ID No. 2, 4, 6, 8, 10, 12 or 14 . 

4. A polynucleotide according to any one of the 
preceding claims which comprises the coding sequence of 
SEQ ID No. 1, 3, 5, 1, 9, 11 or 13 or a fragment thereof. 

5 . A polypeptide which is a GST subunit and comprises 
the amino acid sequence of SEQ ID No. 2, 4, 6, 8, 10, 12 
or 14 or a sequence substantially homologous thereto, or 
a fragment of either said sequence. 

6 . A polypeptide according to claim 5 encoded by the 
coding sequence of a polynucleotide according to any one 
of claims 1 to 4 . 

7. A dimeric protein comprising two GST subunits, 
wherein at least one subunit is a polypeptide according 
to claim 5 or 6 . 

8. A chimeric gene comprising a polynucleotide 
according to any one of claims 1 to 4 operably linked to 
regulatory sequences that allow expression of the coding 
sequence in a host cell . 
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9 . A chimeric gene according to claim 7 wherein the 
regulatory sequences allow expression of the coding 
sequence in a plant cell, 

10. A vector comprising a polynucleotide according to 
any one of claims 1 to 4 or a chimeric gene according to 
claim 8 or 9. 

11. A vector according to claim 10 which is an 
expression vector . 

12 . A cell transformed or transf ected with a vector 
according to claim 10 or 11. 

13. A cell according to claim 12 which is a prokaryotic 
cell or a plant cell, 

14. A cell having, integrated into its genome, a 
chimeric gene according to claim 8 or 9 . 

15. A cell according to claim 14 which is a plant cell, 
wherein the chimeric gene is a chimeric gene according to 
claim 9 . 

16. A cell according to any one of claims 12 to 15 
further comprising one or more further polynucleotide 
sequences coding for a GST subunit, operably linked to 
regulatory elements that allow expression of the subunit 
in the cell . 

17. A process for the production of a polypeptide 
according to claim 5 or 6, which process comprises: 

(a) cultivating a cell according to any one of claims 12 
to 15 under conditions that allow the expression of the 
polypeptide; and 
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(b) recovering the expressed polypeptide. 

18 . A process for the production of a dimeric protein 
according to claim 1, which process comprises: 

(a) cultivating a cell according to any one of claims 
12 to 16 under conditions that allow: 

(i) the expression of the polypeptide according to 
claim 5 or 6 and, if a further polynucleotide sequence as 
defined in claim 16 is present, optionally the expression 
of a further GST subunit encoded by a further 
polynucleotide , and 

(ii) the association of the GST subunit polypeptide 
according to claim 5 or 6 with another GST subunit 
polypeptide to form a dimeric protein according to claim 
7 ; and 

(b) recovering the dimeric protein so formed. 

19. A process according to claim 17 or 18 wherein the 
cell is a prokaryotic cell or a plant cell. 

20. A method of obtaining a transgenic plant cell 
comprising : 

(a) transforming a plant cell with an expression vector 
according to claim 11 to give a transgenic plant cell, 

and optionally, 

(a*) transforming the cell with one or more further 
polynucleotide sequences coding for a GST subunit, 
operably linked to regulatory elements that allow 
expression of the subunit in the cell. 

21. A method of obtaining a first-generation transgenic 
plant comprising: 
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(b) regenerating a transgenic plant cell transformed 
with a vector according to claim 11 to give a transgenic 
plant . 

22. A method of obtaining a transgenic plant seed 
comprising : 

(c) obtaining a transgenic seed from a transgenic plant 
obtainable by step (b) of claim 21, 

23 . A method of obtaining a transgenic progeny plant 
comprising obtaining a second-generation transgenic 
progeny plant from a first-generation transgenic plant 
obtainable by a method according to claim 21, and 
optionally obtaining transgenic plants of one or more 
further generations from the second-generation progeny 
plant thus obtained. 

24. A method according to claim 23 comprising: 

(c) obtaining a transgenic seed from a first-generation 
transgenic plant obtainable by the method according to 
claim 21, then obtaining a second-generation transgenic 
progeny plant from the transgenic seed; 

and/or 

(d) propagating clonally a first -generation transgenic 
plant obtainable by the method according to claim 21 to 
give a second-generation progeny plant; 

and/or 

(e) crossing a first-generation transgenic plant 
obtainable by a method according to claim 21 with another 
plant to give a second-generation progeny plant; 
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and optionally; 

(f) obtaining transgenic progeny plants of one or more 
further generations from the second-generation progeny 
plant thus obtained. 

25. A transgenic plant cell, first -generation plant, 
plant seed or progeny plant obtainable by a method 
according to any one of claims 20 to 24 . 

26. A transgenic plant or plant seed comprising plant 
cells according to claim 13 or 15. 

27. A transgenic plant cell callus comprising plant 
cells according to claim 13 or 15, or obtainable from a 
transgenic plant cell, first-generation plant, plant seed 
or progeny plant according to claim 25. 

28. Use of a polynucleotide according to any one of 
claims 1 to 4 as a selectable marker for detecting 
transformation of a plant cell. 

29. A nucleic acid construct comprising: 

(a) a polynucleotide according to any one of claims 1 
to 4 operably linked to regulatory elements that allow 
expression of the coding sequence in a plant cell; and 

(b) a site into which a further polynucleotide 
comprising a coding sequence can be inserted. 

30. A nucleic acid construct according to claim 29 
wherein site (b) is bounded by regulatory elements that 
allow expression of a coding sequence inserted at the 
site in a plant cell. 



31. A vector comprising a construct according to claim 
29 . 

32. A method of transforming a plant cell or of 
obtaining a plant cell culture or transgenic plant 
comprising : 

(a) providing an untransf ormed plant cell which is 
susceptible to a herbicide whose herbicidal activity is 
reduced by a dimeric protein according to claim 7; 

(b) transforming the plant cell with a vector according 
to claim 29 or 30; 

(c) cultivating the transformed cell under conditions 
that allow the expression of the polynucleotide (a) in 
the construct according to claim 29 or 30; and/or 

(c*) regenerating the cell to give a cell culture or 
plant such that the polynucleotide (a) in the construct 
according to claim 29 or 30 is expressed; and 

(d) contacting the cell, cell culture or plant with the 
herbicide whose herbicidal activity is reduced by the 
dimeric protein according to claim 7, and to which the 
untransf ormed plant cell was susceptible; and 

(e) selecting cells, cell cultures or plants that are 
less susceptible to the herbicide than are corresponding 
untransf ormed cells, cell cultures or plants. 

33. Use of a dimeric protein according to claim 7 in a 
method of identifying compounds capable of metabolism by 
a GST. 

34. A method of identifying compounds capable of being 
metabolised by a glutathione transferase comprising: 
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(a) contacting a candidate compound suspected of being 
capable of being metabolised by glutathione transferase 
with glutathione (GSH) in the presence of a dimeric 
protein according to claim 7; and 

(b) determining whether or not metabolism of the 
candidate compound takes place. 

35. A method according to claim 34 wherein 
metabolism of the compound is detected by determining 
whether or not it is conjugated to glutathione by the 
dimeric protein according to claim 7. 

36- A kit for detecting compounds capable of being 
metabolised by a GST comprising: 

(a) reduced glutathione or homoglutathione ; 
and 

(b) a dimeric protein according to claim 7. 

37 . An antibody which specifically recognises a 
polypeptide according to claim 5 or 6 or a dimeric 
protein according to claim 7 . 

38. A nucleic acid probe which selectively hybridises 
to the sequence of SEQ ID No. 1, 3, 5, 7, 9, 11 or 13 . 

39. A method of identifying compounds that induce GST 
expression in graminaceous plants comprising: 

(a) contacting a graminaceous plant, or a cell or cell 
culture thereof, with a candidate compound suspected of 
being capable of inducing GST expression; and 

(b) determining the level of GST expression in the 
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plant, cell or cell culture. 

40. A method according to claim 39 wherein, in step 
(b) , the level of expression is determined by: (i) 
determining the level of GST protein present by using an 
antibody according to claim 35; or (ii) determining the 
level of GST mRNA present using a probe according to 
claim 37. 

41. A kit for identifying compounds that induce 

GST expression in plants by a method as defined in claim 
37 or 38, which kit comprises an antibody as defined in 
claim 36 . 

42. A method of determining the GST level in a sample 
of seed or flour comprising: 

(i) determining the level of GST protein 
present by using an antibody according to 
claim 35; or 

(ii) determining the level of GST mRNA present 
using a probe according to claim 37. 

43. A method of controlling the growth of weeds at a 
locus where a transgenic plant according to any one of 
claims 25 to 27 is being cultivated, which method 
comprises applying to the locus a herbicide whose 
herbicidal properties are reduced by a dimeric protein 
according to claim 7 . 

44 . A compound identified by a method according to 
any one of claims 34, 35, 39 or 40. 

45. A polynucleotide according to claim 1 

substantially as hereinbefore described with reference to 
any one of the preceding Examples . 



46. A polypeptide according to claim 5 

substantially as hereinbefore described with reference to 
any one of the preceding Examples. 

47. A dimeric protein according to claim 7 
substantially as hereinbefore described with reference to 
any one of the preceding Examples. 

48. A chimeric gene according to claim 8 

substantially as hereinbefore described with reference to 
any one of the preceding Examples. 

49. A vector according to claim 10 substantially 

as hereinbefore described with reference to any one of 
the preceding Examples . 

50. A cell according to claim 12 substantially as 
hereinbefore described with reference to any one of the 
preceding Examples . 

51. A process according to claim 17 or 18 

substantially as hereinbefore described with reference to 
any one of the preceding Examples . 

52. A method according to claims 20, 21, 22 or 23 
substantially as hereinbefore described with reference to 
any one of the preceding Examples . 

53. A transgenic plant cell, first-generation 

plant, plant seed or progeny plant, plant or plant seed, 
or plant cell callus according to any one of claims 25 to 
27 substantially as hereinbefore described with reference 
to any one of the preceding Examples. 

54 . Use according to claim 28 substantially as 
hereinbefore described with reference to any one of the 
preceding Examples . 
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55. A nucleic acid construct according to claim 29 
substantially as hereinbefore described with reference to 
any one of the preceding Examples. 

56. A vector according to claim 31 substantially 

as hereinbefore described with reference to any one of 
the preceding Examples. 

57. A method according to claim 32 substantially 

as hereinbefore described with reference to any one of 
the preceding Examples . 

58. Use according to claim 33 substantially as 
hereinbefore described with reference to any one of the 
preceding Examples. 

59. A method according to claim 34 substantially 

as hereinbefore described with reference to any one of 
the preceding Examples. 

60. An antibody according to claim 37 

substantially as hereinbefore described with reference to 
any one of the preceding Examples. 

61. A nucleic acid probe according to claim 38 
substantially as hereinbefore described with reference to 
any one of the preceding Examples . 

62. A method according to claims 39, 42 or 43 
substantially as hereinbefore described with reference to 
any one of the preceding Examples or 42. 

63 . A compound according to claim 44 substantially 
as hereinbefore described with reference to any one of 
the preceding Examples. 
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ABSTRACT 

NEW PLANT GENES 

This invention relates to glutathione transferase 
(GST) subunits to nucleic acid sequences encoding 
glutathione transferase subunits , and to uses of these 
glutathione transferases and coding sequences , especially 
in the field of plant biotechnology. 
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